CONTEMPORARY 
PHYSICS 


A Journal of Interpretation and Review 


Editor : G. R. Noakes, M.A., F.INsT.P. 


Editorial Board: 


Consultant Editors: Sir Lawrence Bragg, O.B.E., M.C., M.A., D.Sc., F.R.S. 
Sir George Thomson, M.A., D.Sc., F.R.S. 


Professor L. F. Bates, Pu.D., D.Sc., F.R.S. 
Dr. K. A. G. Mendelssohn, F.R.S. 
Dr. F. A. Vick, O.B.E. 


Advisory Editors: 


VOLUME 3 
OCTOBER 1961 to AUGUST 1962. 


ALERE 4 FLAMMAM 


Printed and Published by 
TAYLOR & FRANCIS LTD 


RED LION COURT, FLEET STREET; LONDON; E.C.4 


CONTENTS OF VOL. 3. (Nos. 1—6) 


NUMBER 1—OCTOBER 1961 


Looking at Metals: a Lecture for Schools. Dr. J. W. MARTIN 

Time and Relativity. Part I. Prof. O. R. Friscu ie 

Electromagnetism as a Second Order Effect. Ill: The Biot-Savart 
Law. Dr. W. G. V. Rosser : A 

Cerenkov Radiation: its Origin, Properties and Applications. 
DreJ. V. JELLEY = 

Some Aspects of the Infra-red Absorption Spectra of Semiconductors. 
Dr. D. H. PARKINSON 

The Wool Industries Research Association. Mr. V. ie SAUNDERS 

The Proceedings of the 1960 Annual International Conference on 
High Energy Physics at Rochester, N.Y. Prof. A. SALAM... 


NUMBER 2—DECEMBER 1961 


Clean Air.” Mr. S. H. G. STEVENS . 
Colour and Perception: the Work of Edwin Land i in the Light of Current 
Concepts. Mr. M. H. WILSON and Mr. R. W. BROCKLEBANK 
Electroluminescence. Dr. D. W. G. BALLENTYNE 
Some Exhibits at The Royal Society Conversazione, May 1961: 
1. Determination of Wood crates Variation me 2 ea 
Absorption ae cr 
2. Kinetic Gas Analysis 
3. Graphite —-..; 
4. A Propane-filled Bubble Chamber ; 
The National Physical Laboratory: Solids at Extremely High Pressures. 
Mr. V. T. SAUNDERS... 
The Institute of Physics and The Physical Society Conference on Nuclear 
Physics, April 1961 
Report on the Fourth International Conference and ‘Exhibition on 
Medical Electronics, New York, 1961. Dr. A. NIGHTINGALE _... 
The Rutherford High Energy Laboratory of the National Institute for 
Research in Nuclear Science 
Essay Review of Reports on Progress in 1 Physics, Vol. 24 (1961), 
Mr. D. F. GIBBs .. Be + 
The British Association Meeting at Norwich. Miss M. vy, GRIFFITH ... 
Rutherford in Manchester. Dr. J. E. GEAKE a aie a 
Book Reviews ... : ate Are we 


NUMBER 3—FEBRUARY 1962 


Physics Applied to Archaeology: Part I. Dr. M. J. AITKEN 
The American Association of Physics Teachers: Demonstration Experi- 
ments in Physics .. seh ae 
The Molecular Approach to Biology. Dr. D. M. Blow 
Time and Relativity: Part II. Prof. O. R. FRISCH 
The Diploma in Technology: Courses in ie ee 
Dr. N. PENTLAND & 
Essay Reviews 
Interferometry (The National Physical Laboratory). 
Dr. H. G. KUHN a, 
The Fermi Surface (Ed. W. A. Harrison and M. B. Webb). 
Dr. B. R. COLes 
The International Conference on Cosmic Rays and the. Earth Storm, 
Kyoto, Japan. September 3-15, 1961. Dr. P. L. MARSDEN 


1V > CONTENTS OF VOLUME Ii 


| i ference. Dr. E. W. LEE 
61. The International Magnetism Con 
ihe Te Annual Exhibition of The Institute of Physics and The Pave 
Society, 15-19 January, 1962 ... 
Harwell Recctor School: Science Masters Course No. oa 
Book Review ... ; ; a a6 


NUMBER 4—APRIL 1962 


Electrons in Metals: A Short Guide to the Fermi Surface. 

Part I. The Electron Gas. Dr. J. M. ZIMAN pire. in Noes 
Some Physical Aspects of Lubrication and Wear. Dr. A. OOK... 
Two International Colour Meetings. Mr. R. W. BROCKLEBANK 
The Laws of Electromagnetism and an Electrostatic Analogue. 

Dr. J. ALAN CHALMERS . : 
Faraday’s Law of Induction and the Force ¢ ona Body due to Change i in 

its Magnetization in an Electric Field. Dr. K. G. RAMANATHAN ... 
The NPL “ Microwave Mekometer”’. Mr. V. T. SAUNDERS . 3 
The Nuffield Foundation and the British Physics Teaching Project 
DSIR Research Establishments: 

The Building Research Station, Garston. Mr. V. T. SAUNDERS ... 
The British Association Granada Lectures 1961. Mr. V. T. SAUNDERS... 
Masses of Nuclei. Professor E. B. PAUL ... : : 
Essay Reviews 

Proceedings of an International Conference on Instrumentation 
for High-Energy Physics (Lawrence Radiation pee 
Professor C. C. BUTLER ' ' , 
Isotope Separation. Professor E. B. PAUL 
Book Reviews ... 


NUMBER 5—JUNE 1962 
E lectronsin Metals: A Short Guide to the Fermi Surface. Part /J. Bands 
and Zones. Dr. J: M. ZIMAN . wen 
Physics Applied to Archaeology—Part I. “Dr. M. J. AITKEN a: 


Electromagnetism for Schools, based on M.K.S.A. Units. 
Mr..E. J. Hay es sor oe 


The Physical Science Study ¢ Committee 
(4) The Working Session on Physics Teaching at the Cavendish 


Laboratory, Cambridge. Dr. Urt HABER-SCHAIM 
Ozone in the Earth’s Atmosphere. Dr. R. Frita 


The Philosophical Basis of Physics. Dr. Garp BUCHDAHL 


The Science Masters’ Association Annual po Mr. D. W. HARLOW 
Book Reviews .. : Se 5 ; 


NUMBER 6—AUGUST 1962 


Electrons in Metals : A Short Guide to the Fermi Surface. Part IIT. 


Dynamics of Bloch Electrons, and the Calculation of Band Structure. 
Dr. J. M. ZIMAN : 


The Metre. Dr. H. BARRELL ae 

Lasers. Professor D. J. E. INGRAM : a 
Ampére as a Contemporary Physicist. Dr. R. A. sR: TRICKER = 

The Royal Institution Christmas Lectures, 1961. Mr. V. T. SAUNDERS 
Sciences in Communist China. Dr. K. MENDELSSOHN a 

Essay Review of Cosmic Rays et 1), the first part of Vol. 46 of the 


Handbuch der re ne aie Professor J. G. WILSON 
Book Reviews : fs ss A 


Index to Vol. 3 


222 


239 
240 
240 


241 
257 
DAP, 


278 


286 
290 
292 


293 
297 
299 


303 
304 
307 


321 
334 


399 


368 
315 
382 
395 
398 


401 
415 
435 
453 
469 
474 


477 
480 
481 


Looking at Metals :a Lecture for Schools 


by J. W. MARTIN 
Department of Metallurgy, Oxford University 


1. INTRODUCTION 


The science of metallurgy is a young one. Whereas in the chemical industry 
many processes are the early laboratory experiments done on a large scale, 
metallurgical industry has developed mainly from the older arts and crafts. 
It is only in comparatively recent times that any considerable fundamental 
work has been carried out on metals and alloys, and there are now well 
established and rapidly growing branches of science which are related to metal- 
lurgical industry in the same way that the science of chemistry is related to 
chemical industry. 


Fig, 1. Section through cast steel bar. Actual size. 


About sixty-five of the chemical elements are metals, although most of them 
are combined with non-metal atoms, such as oxygen, in substances similar to 
stone or clay. The first step in making metals available for human use is freeing 
the metal atoms from this combined state: this is the province of the extractive 
metallurgist. The physical metallurgist is the one who has to know why one metal 
is different from another, and knows how to combine different metals in alloys 
in order to make them more useful. By studying their behaviour he knows 
how he can improve their properties by giving them special heat-treatments or 
by working them in rolls or under forging hammers. 


C.P, A 


2 J. W. Martin 


Physical metallurgy had its beginning in 1808 when Alois de Widmann- 
statten polished the surface of a meteorite, and upon etching it discovered a 
beautiful geometric pattern that now bears his name. Since this meteorite 
was a nickel-iron alloy, it was soon deduced that metals and alloys are actually 
crystalline, even though they lack the geometrical surfaces of many non- 
metallic crystals. Figure 1 is a photograph (actual size) of a section through a 
cast bar of alloy steel which has been ground and polished fiat. Acid attack 
has revealed the individual alloy crystals, which are 1-2 mm. in diameter. 
Although it is possible under special conditions to grow metal crystals as large 
as several centimetres in cross-section, in most familiar metal objects the 
crystals (or ‘grains ’) are too small to be seen with the naked eye, and it is the 
microscopic structure of metals (or ‘ metallography’) which will be discussed 
in more detail below. 


i TO EYEPIECE 


OBJECTIVE LENS 


| | SPECIMEN 


Fig. 2. The principle of the metallurgical microscope. 


2. OpticaL METALLOGRAPHY 


H. C. Sorby, of Sheffield, may be regarded as the founder of metallography. 
In 1864 he overcame the chief difficulty—the preparation of the metal surface. 
Earlier methods involved breaking the metal with a hammer blow and then 
looking at the fracture surfaces. As these are rough, and the depth of focus of 
the microscope is small, satisfactory images could not be obtained and little 
progress was made. ‘The method developed by Sorby for preparing a metal 
for examination with the microscope is, with slight modifications, still the one 
most widely used today. 

A specimen 1-2 cm. in its largest dimension is cut from the metal to be 
examined. A mirror polish is produced on one face of the specimen by grinding 
on an abrasive wheel, polishing on successively finer emery papers, and lapping 
on revolving cloth-covered wheels with fine abrasives. The polished surface 
is produced by the melting, flowing and resolidification of the surface layer due 
to heat generated from the friction between the particles of the abrasive and the 
asperities on the metal. The surface is thus not truly representative of the metal, 


Looking at Metals 3 


and to make the underlying structure visible, the flowed layer has to be removed 
by ‘etching ’—usually with a dilute acid. The etchant also attacks various 
parts of the specimen at different rates and reveals the structure. 

The principle of the metallurgical microscope is illustrated in fig. 2. 
A horizontal beam of light enters an aperture in the microscope tube, where it 
strikes a glass plate (G) inclined at an angle of 45°. About 10 per cent of the 
light is reflected down through the objective lens, where it strikes the specimen 
surface and may be reflected back through the objective lens to form the primary 
image, which is then magnified further with the eyepiece. 

On examining a polished and etched surface of a metal or a simple alloy 
under the microscope, a polygonal network of grains is observed (fig. 3), similar 
to that seen with the naked eye in the specimen of fig. 1. From such a structure 
it is concluded that a metal consists of an agglomeration of crystals fitting together 
as shown in the diagram of fig. 4—this is an idealised drawing of the situation 
of course, each of the grains is of uniform shape and size and of one of the few 
regular solid shapes that can be used for completely filling space. 


Fig. 3. Polished and etched uranium, magnified x 200. (D. M. Davies) 


Etching the polished section of fig. 3 has attacked each grain, and the sharp 
‘ grain boundaries’ between neighbouring crystals appear clearly under the 
microscope. By crushing at high temperature it is sometimes possible to make 
a metal come apart at the grain boundaries, and a photomicrograph of separated 
grains of brass (60 per cent copper-40 per cent zinc) appears in fig. 5. It is 
clear that these grains are tending towards the ideal shape (fig. 4), although 
they are far from uniform. 

When a small amount of alloying element is added to a metal, it will usually 
form a solid solution whose microstructure is indistinguishable from that of a 
pure metal, since the crystalline arrangement is the same. When larger 
amounts of alloying metal are added, however, the limit of solubility may be 
exceeded, and crystals of a different arrangement (containing atoms of each 
metal) will appear in the structure (fig. 6). The first formed solid solution is 


4 J. W. Martin 


called the primary phase, and the new crystals the secondary phase; as more and 
more alloying element is added to the alloy, more of the secondary phase will be 
produced and less of the primary phase, until eventually only the secondary 
phase is present, and once more the microstructure will be the same as for a 
pure metal. Further additions of the alloying element may produce another 
phase and the sequence of changes may be repeated. 


Inge. 5), Separated grains of brass, magnified x 50, 


Looking at Metals 5 


The mechanical properties of a metal are closely related to the micro- 
structure. ‘The strength increases as the grain-size is reduced, and in two- 
phase alloys the strength increases as the spacing between the particles of the 


second phase is reduced. The changes in microstructure that occur when a 
metal is deformed will next be considered. 


Fig. 6. Two-phase structure of magnesium—44% cerium alloy. 


Fig. 7. Deformation by ‘ slip ’. 


3. DEFORMATION OF METALS 


Pure metals are characteristically soft and ductile: although under slight 
deformation they behave elastically and spring back to their original shape, they 
are readily plastically deformed—i.e. undergo a permanent change of shape. 


6 1 ea i Martin 


The study of a single metal crystal (say in the form of a wire) will show how this 
plastic flow commonly occurs, and fig. 7 represents such a crystal after deforma- 
tion by extension. This type of plastic flow is called ‘ slip’, since it occurs 
by the slipping of crystal planes over each other—the movement taking place 
along a certain crystallographic “ slip direction’? and usually on a crystallo- 
graphic ‘slip plane’. Figure 8 is a photograph of a deformed single crystal 
wire of cadmium—the lines on the surface are due to the surface steps arising 
from the slip process indicated in fig. 7. These lines will only be readily 
visible, of course, if the original surface is very smooth, and they are usually 
called ‘ slip bands ’. 


Fig. 8. Slip in a single crystal of cadmium. 


An ordinary piece of (polycrystalline) metal will also deform by the process 
of slip within each individual grain, although the grain boundaries between two 
differently oriented crystals are regions where slip will not readily occur. ‘Thus 
a polycrystalline specimen will be harder to deform than a single crystal, and 
metal of fine grain-size will be stronger than coarse-grained metal. ‘To demon- 
strate that deformation by slip occurs in polycrystalline metal, one would first 
prepare a normal metallographic section (as in fig. 3), and then plastically deform 
the specimen (perhaps by squeezing in a vice). Figure 9 (a) is a typical example 
of the resultant structure in the case of aluminium: on the surface of each grain 
are numerous parallel slip-bands, each crystal having produced these surface 
steps by a process similar to that shown in the diagram of fig. 7. 

In many metals, however, a slightly more complicated slip process can occur. 
In the examples of figs. 8 and 9 (a) each crystal has only slipped on one set of 
‘ slip-planes ’, thus producing only one set of parallel slip bands on the surface. 
Slip quite commonly occurs simultaneously on more than one set of planes, and 
fig. 9(b) shows the effect on the slip-band distribution when this ‘ multiple 
slip’ has taken place. Here part of the surface of only one grain is shown, and 
slip has taken place on two sets of planes. 

Although the slip movement illustrated in fig. 7 resembles that of a pack of 
cards which has been pushed lengthwise, the actual mechanism by which slip 
takes place does not correspond to the sliding of cards. It is found that crystals 
slip under load at stresses very much below those which would be required to 
move two perfect crystal planes past one another; in fact metal crystals are 
between 1000 and 10000 times weaker than theory would predict. Nearly 


Looking at Metals 7 


thirty years ago a theory was put forward to account for this discrepancy, but 
only in recent years have modern metallographic techniques brought supporting 
proof of the hypothesis. 


(a) 


(6) 


Fig. 9. Slip bands on polycrystalline aluminium: x 750. (The small triangular and 
polygonal surface pits were produced by the etchant.) 


4. DISLOCATIONS 


A more realistic picture of the slip process would be to visualise slip beginning 
in one small area of the plane and then spreading over the rest of the plane. 
While this is taking place, the slip plane will be divided into a slipped area and 
an unslipped area, and the line of demarcation (called a dislocation line) will be 


8 J. W. Martin 


moving across the slip plane. Professor N. F. Mott has made the amusing 
simile that the passage of a dislocation line across a slip-plane of a crystal is like 
the movement of a ruck in a carpet. It is obviously more difficult to make one 
carpet move bodily over another by pulling one end of it, than to make a ruck 
in the carpet, and to move the ruck along. The ruck separates the ' slipped 
from the ‘unslipped’ part of the carpet in the same way that a disclocation 
line separates the slipped and unslipped regions of a slip plane in a crystal. 

The diagram of fig. 10 (a) represents a simple form of dislocation line EE 
lying in the slip plane ABCD of a small crystal block. When the upper part 
of the crystal is stressed as indicated by the arrow, slip takes place as shown in 
fig. 10(b) by the movement of the dislocation line to the surface at CD— 
producing a step there as in fig. 7. 


©) 
Fig. 10. A simple dislocation (EE) in a crystal block. 


The stresses required to make dislocation lines move are calculated to be 
in good agreement with the measured strengths of metal crystals—the presence 
of dislocations thus accounting for the weakness of such crystals. All soft 
metal crystals contain such lines—the average distance between them being a 
few thousand atoms. They will form a network within each grain—a dislocation 
line cannot end within a crystal, it must either run to a free surface, to a grain 
boundary or to another dislocation. 

The process of slip increases the dislocation content of the metal. When a 
metal is plastically deformed, the yield stress for further deformation increases 
strongly—this effect being known as ‘ work-hardening’, and is a valuable 
engineering property. This phenomenon will occur when dislocations moving 
on intersecting slip planes meet and obstruct one another’s motion (e.g. fig. 
9 (b) ); a high density of dislocations then builds up at such places of intersection. 
A grain boundary may act as a barrier in a similar way, and a ‘ traffic jam’ of 
dislocations may pile up at such an interface. 

The presence of dislocations can thus account for the softness of metals, 
and also for the fact that they become hard when deformed. The scale of the 
phenomena associated with dislocations is far below the resolution of the optical 
microscope, however, and it is the application of the electron microscope to 
metallography which has, in recent years, increased our knowledge in this 
field. Instruments are now commercially available which are capable of 
resolving 7 A (i.e. of discerning two points 7 x 10-8 cm apart), but before con- 
sidering its application to the study of dislocations in metals, the various ways 


in which the electron microscope may be used by the metallographer will be 
outlined. 


Looking at Metals ) 


5. ELECTRON METALLOGRAPHY 


The conventional electron microscope is of the transmission type—that is, 
an electron beam passes through the specimen to be examined and is then 
focussed by the objective and projector ‘lenses’ to produce an image on a 
fluorescent screen. A diagram of the optical system appears in fig. 11; the 
entire microscope tube is under high vacuum, and the magnetic ‘lenses’ 
which focus the electron beam consist of carefully designed electromagnets. 
Contrast in the image results from differences in scattering-power when the 
electron beam passes through the thick and thin parts of the specimen, or the 
parts having differences in atomic composition. 


ELECTRON GUN 


SPECIMEN 


Trea, 
K6505) 
> 


OBJECTIVE LENS 


oO 
Noketes 


PROJECTOR LENS 


FLUORESCENT 
SCREEN 


Fig. 11. The electron microscope; a schematic diagram. 


Metallographers were therefore confronted with a difficult problem: metals 
readily absorb electrons, and it was not easy to produce suitable thin films for 
examination by transmission. The problem was solved by the development 


10 J. W. Martin 


of replica techniques. For example, an imprint of the differences in surface 
height on a polished and etched metal surface may be obtained by adding a 
dilute solution of a plastic material in a volatile solvent (rather like ‘ nail varnish ) 
to the metal surface. When the solvent evaporates a thin film of plastic is left, the 
upper surface of which is flat, while the lower surface follows the contours of the 
etched metal (fig. 12). This film is now stripped from the metal, placed on toa 
small copper supporting grid (fig. 13), and is then placed in the electron micro- 
scope. ‘The contrast in the image is obtained by the differences in the scattering 


REPLICA 
METAL 


ZY 


STRIPPED 
REPLICA 


Fig. 12. Production of a surface replica. 


Fig. 13. Electron microscope replica on Copper supporting-grid: x 35, 


of the electrons as they pass throu i 
gh the thin and thick regions of the repli 
which correspond to the raised and lowered parts of the — surface. hee 


14 shows an electron micro 1 
) graph of a plastic replica f t isti 
particles of iron carbide (white) embedded in ronan Mienciinas: 


Looking at Metals of 


Fig. 14. Plastic replica from a tempered steel. Electron micrograph: x 10,000. 


With plastic replicas it is impossible to reproduce detail finer than 100-200 
Angstrom units. The plastic material is composed of long molecular chains, 
and when it is irradiated with electrons in the microscope the chains appear to 
straighten and to combine with other chains, so producing much larger molecules 
which limit the resolving power. These difficulties were overcome by the 


Fig. 15. Carbon replica from a steel. Electron micrograph: x 24,000. 


development of the carbon replica method. The polished and etched metal 
specimen is placed in an evacuated chamber containing two pointed carbon 
rods in contact. When a current of about 50 amp is passed through the rods 
for a few seconds, carbon evaporates from them and recondenses on the metal 
surface to form a film about 50-1004 thick. The film is then stripped by 
chemically attacking the underlying metal, and the replicas obtained give ex- 
cellent contrast in the electron microscope; they are very stable and almost 


12 J. W. Martin 


structureless, and resolving powers of better than 25 A units may be readily 
obtained. Figure 15 shows an electron micrograph of a carbon replica from a 
steel consisting of ‘laths’ of iron carbide embedded in iron. 


Fig. 16. Transmission electron micrograph of an aluminium foil: x 7000. 


Fig. 17. Dislocations piled up against a grain boundary in slightly deformed stainless steel: 
x 17,500. (Courtesy of Dr. P. B. Hirsch.) 


Within the last few years techniques have been perfected, and the design of 
electron microscopes improved, enabling thin metal foils to be examined 
directly by transmission in the microscope. One such method involves rolling 
the metal to a thin foil, which is then made the anode of an electrolytic cell. 
Under carefully controlled conditions the foil will get evenly thinner by 
dissolution—eventually dropping into holes. A specimen is then cut from the 
edge of a hole and mounted as in fig. 13. Figure 16 shows an electron micro- 
graph of an aluminium foil taken by transmission—the grain distribution is 
similar to that in figs. 1 and 3. 

One of the more fascinating discoveries has been that it is sometimes possible 
to render dislocation lines visible in the electron microscope, when examining 


Looking at Metals 13 


thin metal foils in transmission. The actual discontinuity in the crystal at a 
dislocation line is only about the size of an atom, and so beyond the resolution 
even of the electron microscope. The presence of this ‘ruck’ in the crystal 
does, however, squeeze or stretch the surrounding part of the crystal (represented 
in fig. 10 (a) ), so that its effect can be ‘ felt’ perhaps over 100 A. When 


Fig, 18, Distribution of dislocations in stainless steel deformed more heavily than in Fig. 17: 
x 17,000, (Courtesy of Dr. P. B. Hirsch.) 


the electron beam is transmitted through such material, the distortions around 
the dislocation lines scatter the electrons more (or less) than the perfect parts of 
the crystal, and so the positions of the dislocations appear clearly in the image. 

In fig. 17 a situation described in the previous section is directly observable. 
This foil, from a deformed steel specimen, shows a number of short dislocation 
lines (for they only extend from the upper to the lower surface of the foil) 
piling up at a grain boundary. A number of such ‘traffic jams’ are clearly 
visible. When more severely deformed metals are studied in this way, the 
complex tangle of dislocations is much less easy to interpret: one such work- 
hardened structure appears in fig. 18. 


6. OTHER MECHANICAL PROPERTIES OF METALS 


Many interesting mechanical properties of metals may be accounted for in 
terms of dislocation movements. For example, if dislocation motion is blocked 
by a hard particle inside a crystal, or at a grain boundary, such a material may 
become vulnerable to crack formation through the simple mechanism illustrated 
in fig. 19. A high concentration of stress collects where the dislocation is 
blocked (19 5), which by the coalescence of several dislocations (19 ¢c) may 
nucleate a tiny crack. In a ductile metal this will not be a serious situation, as 
plastic flow can greatly increase the force required to make a crack run. But 
this benefit is dependent upon temperature: as the temperature decreases, the 


14 ik W. Martin 


tendency to flow also decreases, and there will be increasing danger of the 
crack spreading. 

aoe metals, such as ordinary steel, are normally resistant to cracking, but 
become highly susceptible to it at low temperatures. This temperature effect 
caused much trouble with wartime ships: in southerly waters there was no 
problem, but during the winter months in northern waters the hulls often 


e ° . @ O e 


BSUBUE 35 
BEPHSOOS 
SGeSeOo 
SSSSSCC 
SBSQEGGO 09 
Se8uegocc 
Ss 


Fig. 19, Crack formation by blocking dislocations. 


cracked under shocks from heavy seas. Sometimes a crack ran instantly through 
the plates of a ship (fig. 20); at other times a crack remained dormant and 
caused serious damage later, Tt need hardly be said that the study of the de- 


formation and fracture of metals forms an important part of current research in 
physical metallurgy. 


Looking at Metals 15 


Two methods of eliminating any deleterious effects due to the presence of 
dislocations in metals may suggest themselves. Firstly, one may attempt to 
make them immobile through controlled precipitation of fine particles in the 
crystals by careful methods of alloying and heat-treatment. These will provide 
a series of obstacles which the dislocations must overcome; some may tend to 
anchor dislocations in position. Many strong alloy steels owe their hardness to 
this effect. Secondly, the possibility of eliminating dislocations altogether 
from the metal might be considered. So far, only tiny hair-like metal crystals 
have been prepared in a dislocation-free state. These ‘ metal whiskers’, as 
they are called, are relatively tremendously strong (a tin whisker having greater 
relative strength than our strongest alloy steels), and it is interesting to speculate 
whether large-scale material will be produced in this condition in the future. 


Fig. 20. Failure at sea of an American T/2 tanker. 


In this survey it has only been possible to touch upon a few of the interests of 
the physical metallurgist. It is a fascinating (and rapidly expanding) field of 
study, and there are many opportunities for a research career in it, both in 
Government laboratories (including those of the Atomic Energy Authority) as 
well as in the private laboratories of many large industrial firms. It is also 
hoped that increasing numbers of sixth-formers with A-level physics, chemistry 
and mathematics will choose to read metallurgy when they enter a university. 


The Author: 
Dr. John Martin is a graduate in metallurgy of St. John’s College, Cambridge, where 


he held a Fellowship from 1954-1957. He is at present a University Lecturer at Oxford, 
and Goldsmiths’ Fellow in Metallurgy at St. Catherine’s College. 


Time and Relativity: Part I. 


by O, R, FRISCH 
Cavendish Laboratory, Cambridge 


The questions I shall discuss are not new, nor are the answers: they can 
mostly be found in Einstein’s early papers (1905, 1911). But the coming of 
artificial earth satellites has offered new possibilities for testing some of Einstein’s 
predictions, and some tests have actually been carried out with the help of the 
recoilless gamma radiation (Méssbauer 1959). Furthermore Dingle (1956, 
etc.; for a review see Sherwin 1960) has cast doubt on some consequences of 
Einstein’s theories, and the ensuing controversy has left many people thoroughly 
bewildered. Hence a presentation of the behaviour of time according to the 
theory of relatively might be useful. 

Time is of course measured with clocks, and there is a variety of them. A 
grandfather clock would clearly be no use in a spaceship where gravity may be 
absent (or greatly increased as during take-off), and even a wristwatch may be 
slightly affected by acceleration. But that is no essential difficulty: an atomic 
clock—which uses the frequency associated with some atomic transition— 
would be far less sensitive to acceleration, and nuclear frequencies are less 
sensitive still (Sherwin 1960). 

To appreciate what the relativity theory has to say about time, we recall 
what Newton said in his Principia: ‘“‘ Absolute, true, and mathematical time, 
of itself and from its own nature, flows equably without relation to anything 
external . . .”. That assumption has been criticized because it cannot be 
tested: Newtonian time is ticking away all through space, as it were, but its 
ticks cannot be heard. However, we must remember that Newtonian physics 
has no speed limit; so the signal from a master clock can be sent to anywhere 
in space with as little delay as you please so that you can tell what time it is at 
any place without ambiguity. 

Relativity—in a limited sense—is part of Newtonian physics. A frame of 
reference relative to which any free mass point moves in a straight line at constant 
speed is called an inertial frame, and it was known at Newton’s time that any 
frame that moves at constant speed in a straight line relative to an inertial frame 
is again inertial. That fact, that the laws of mechanics are equally valid in any 
two inertial frames is sometimes called the Galilean principle of relativity. It 
does not embrace the propagation of light (or other electromagnetic phenomena). 
But from about 1887 (Michelson and Morley) evidence accumulated that 
electromagnetism ought to be included in the principle of relativity; in par- 
ticular, light appeared to have the same speed € relative to all frames of reference. 
Various attempts were made to modify mechanics accordingly; but the first 
simple and consistent presentation was given by Einstein in 1905. 

In this article I shall not try to present all of relativity theory, but only those 
features which relate to the question: How do moving clocks behave? The most 
striking features of that behaviour are these: 

(1) Two events, simultaneous by the clocks in one inertial frame, will in 
general not be simultaneous by the clocks in another inertial frame. 

(2) of two identical clocks in relative motion, each will be observed to 
go slow by an observer moving with the other. 


Time and Relativity 17 


(3) of two identical clocks in a gravitational field, the one at higher gravita- 
tional potential will go faster. 


These features I shall derive as directly as possible from these two basic 
assumptions: (1) any two inertial frames are equivalent, and (2) light always 
travels at the same speed. In fact it is best to combine these two into one 
sentence which expresses the special relativity principle: 

Any two inertial frames are equivalent, also with regard to the propagation 
of light. 

From this one assumption we can derive the statements (1) and (2) about 
clocks. ‘To derive (3) we also need the principle of the universal proportionality 
of weight and mass (also called the equivalence principle); this will be discussed 
in a second article. 

The decisive difference from Newtonian physics is that we now have a speed 
limit: no signal can travel faster than light. So to synchronize two clocks we 
must make do with messengers that have finite speed. In principle, all the 
subsequent arguments could be carried through if we employed boys on 
bicycles to set the clocks, but we would have to know how their speed trans- 
forms from one frame of reference to another. So it is better to use light as a 
messenger; its constant speed (in vacuo) relative to all frames of reference 
makes all arguments much simpler. 


Let us start with a simple problem: how would Jack and Mac, several 
thousand million miles apart, but both at rest in the same inertial frame S, 
synchronize their clocks? Mac might start by adjusting his clock so that it 
always reads the same time that he can see—through his telescope—on Jack’s 
clock. But soon he gets a message that his clock, as seen by Jack, is several 
hours behind; so he splits the difference by advancing his clock by half that 
amount. Now both clocks behave in the same way: each, as seen from the other 
clock, appears late by the same amount (the time the light takes from one to the 
other). Hence they can now be considered synchronized according to the 
relativity principle. 

An alternative way of synchronizing those clocks would have been to set 
them both by a master clock, placed half-way between Jack and Mac. The 
symmetry of that procedure guarantees in a transparent way the equivalence of 
the two clocks after setting, and the result is of course the same. 


Let us now place a coordinate system in S so that Jack is in its origin, and 
Mac on the positive x-axis. Next we introduce two more clocks, belonging to 
Ed and Fred who travel along the x-axis at the speed v, with Fred in front. 
Since everything happens on the x-axis we can ignore the y and z axes and 
draw the usual space-time graph, fig. 1. A point on that graph means an event, 
and we shall look at the two events O (Ed passes Jack) and P (Fred passes Mac). 
By chance O and P occur at the same time, according to the clocks in S, i.e. 
Jack’s and Mac’s. But not so according to Ed and Fred (frame 8’): They will 
have synchronized their clocks by the same procedure, say, with a master 
clock that moves along, half-way between them, and a glance at fig. 1 shows that 
by their clocks the event synchronous with O is not P but Q, which is later. 
Both pairs of clocks have been synchronized by the relativity principle. If one 
pair, say Jack and Mac, is considered at rest, then of the two moving clocks the 
one in front (in space) is behind (in time). Of course the relativity principle 


C.P. B 


18 O. R. Frisch 


‘oy t’ 


Fig. 1. Relativity of Simultaneity. In this space-time diagram, each point represents an 
event. Events at the same place in the inertial system S (the ‘ rest frame ’) lie on 
a line parallel to the t-axis (chosen vertical) ; simultaneous events lie on a line parallel to 
the x-axis (chosen horizontal). The travel of a light signal is shown by a dotted line. 

Jack and Mac are represented by bold vertical lines since they are both at rest in 
S. M is a master clock half-way between them; a light flash emitted at F will 
reach them at O and P; for reasons of symmetry O and P must be simultaneous 
in S, confirming our choice of the horizontal direction for the x-axis. 

But in S’, the moving inertial system in which Ed and Fred are at rest, O and P 
are not simultaneous. A flash emitted at F’ from the master clock M’ (at rest in S’ 
and half-way between Ed and Fred) will reach Ed and Fred at O and Q respectively; 
so O and Q are simultaneous in S’ and Q is later than P. Fred’s clock will read 
less as he passes Mac than Ed’s clock reads as he passes Jack although Jack and Mac 
record those two events as simultaneous: “ The clock in front is behind in time ’’. 

If we choose the origin of S’ (i.e. the point x’=0, t’=0) also at O, then the 
x’-axis must connect O with Q since O and Q are simultaneous in S’. It is con- 
venient to choose length and time units so that the speed of light c=1; then the 
‘light lines’ are at 45° to the vertical and with the help of the two auxiliary lines 


OR and QR one sees easily that axx’= — xtt’, a fact that is sometimes useful to 
remember. 


allows us to consider S’ at rest and S in motion, in the opposite direction: then 
Jack is ahead of Mac in space and indeed again behind in time. 

It is surprisingly easy to get wrong results by overlooking that ‘relativity 
of synchronism’. For instance take the following arguments: the light energy 
from the flare-up of a nova is contained in an expanding spherical shell, with the 
nova at the centre; a body, flung off at great speed by the explosion, would not 
be at the centre of that shell; ‘ hence relative to that body the light has not 
been spreading at the same speed in all directions’. That conclusion contradicts 
the relativity principle and must be wrong. The fallacy is that the shell we 
mentioned contains the light energy at a given time relative to the nova; if we 
ask where the light energy is at a given time relative to the moving body we shall 
again find a spherical shell, now with that body at the centre. The motion 
of the nova when it exploded is quite irrelevant; relative to any inertial frame 


the shell is centered about the point where, in that frame, the nova was when 
it exploded. 


Time and Relativity 19 


By the way, the expanding ring around the Nova Persei of 1901 is not that 
shell. The shell as such can never be seen: the light from its different parts 
would reach us at widely different times. What we see today are those scattering 
particles by which the light can reach us with 60 years’ delay: they form an 
ellipsoidal shell with us at one focus and the place (in our inertial frame) of the 
nova explosion at the other. That light reaches us from all directions; but it 
looks weak near the centre because back-scattering is weak, and falls off again 
with increasing distance from the explosion, giving the observed pattern of a 
diffuse ring. 

Now to our second statement, concerning the behaviour of clocks in relative 
motion. Let us see how Jack and Ed would synchronize their clocks. Setting 
them is easy: as they pass each other they will set their clocks to the same time, 
say zero. But after that they will move apart and light signals will take more 
and more time from one to the other. Let us assume that Ed at first adjusts 
the rate of his clock by the receding face of Jack’s clock, ignoring the growing 
delay. His clock then reads unit time when he sees unit time on Jack’s clock; 
but Jack reports that he had to wait the longer time (¢+v)/(¢—v)—see fig. 2— 
before he saw unit time on Ed’s clock. Once again one has to split the difference, 
or rather the ratio: Ed speeds up his clock by the factor f=/ {(¢+ v)/(¢—v) } 
and resets it so that it extrapolates back to zero as before. Now the clocks are 


again equivalent: each clock keeper sees the receding clock go slower by the 
factor 1/f. 


Fig. 2. Relativistic Doppler effect. Both Jack and Ed have set their clocks to zero when 
they passed each other (event O). Some time later (event D) Ed observes that 
Jack’s clock reads t, (event B) and naively he sets his clock to ¢, as well; but soon 
Tack reports that he had to wait until the time t, before (event C) he saw t, on Ed’s 
clock. If the units are chosen so that c=1, then AB=AC=AD=(v/c). AO; it 
follows that t,/t;=OC/OB=(e+v)/(¢—v). So Ed should have adjusted his clock 
to a speed f= \/{ (¢ + v)/(c —v) } times higher than he did; when that is done the two 
clocks are equivalent in that each appears slow by the factor 1/f if viewed from the 
other. 


Again, the synchronization could have been done more simply by the use 
of a master clock halfway in between. In order to guarantee the symmetry of 
the set-up we choose our rest system so that the master clock is at rest at its 
origin, and we assume Tim and Jim with their two clocks move away from it at 
speeds +v and —v respectively. Once we have done that we can forget the 


20 O. R. Frisch 


master clock: clearly the two clocks will be synchronized if their ‘ticks’ are 
represented by equal lengths in fig. 3. When Tim looks back at Jim’s clock 
it will again read less than his own, and the ratio of the two readings is found 
(see fig. 3) to be (c+v)/(¢—v)=f*. That agrees with the result obtained in the 
last paragraph: Tim sees the master clock go slow by the factor 1/f and the keeper 
of the master clock sees Jim’s clock go slow by that same factor; the two factors 
must be the same because the relative velocities are the same both times. (By 
the way, the relative speed of Tim and Jim is not 2v but 2v/(1+v?/<?)=V, 
and the factor f replaced by F = 1/ {(¢ + V)/(€— V)} =(€+2)/(C—2) =f*,) 


Fig. 3. Another derivation of the Doppler factor f. Tim and Jim are travelling away 
from Jack in opposite directions, at speed v=c.tan «. In Newtonian physics 
Jim’s clock would appear slow to Jack by the factor EO/CO = 1/(1+v/c¢), whereas 
Jack’s clock appears slow to Tim by the factor CO/DO =1—v/c; the Doppler factor 
here depends on whether the source or the receiver of the signals is at absolute 
rest. In relativity theory the two factors must be alike, but their product must be 
(c es +v) as before; hence the factor must be \/{ (¢—v)/(¢+v) }=1/f, as derived 
in Fig. 2. 


This of course is nothing but the Doppler effect. We have spoken of 
discrete time signals sent from one clock to the other, but it would be equally 
true for the successive crests of a long radio wave where the wave crests can 
be observed on an oscillograph; and since there is no difference in principle 
between radio waves, light or gamma rays, we conclude that our factor 1/f 
applies equally to the frequencies of all those radiations. Light from a receding 
source will be red-shifted, its wavelength increased by the factor f; if instead 
the source approaches we get a blue-shift, and by replacing v by —v in the 
expression for f we find that the wavelength is now shortened by the factor 
Lif. Any source of monochromatic radiation is a clock for our purposes 
though with light or gamma rays we cannot count the individual ticks. 

Of course the Doppler effect was known in classical physics, but there it 
depended on the absolute motion of the clocks (i.e. their motion relative to the 
ether). Figure 3 shows that the frequency of the receding clock was decreased 
by the factor (1—v/c) as seen from the clock at rest whereas the other way 
round the factor was 1/(1+v/c). The relativistic factor 1/f is seen to be the 
geometric mean between those two classical values. By the way, the classical 


formulae can still be applied, e.g. to the Doppler eff in still ai 
if the sound velocity is called “‘ ¢ ” PE ee 


Time and Relativity 21 


Let us now go back to relativity and ask for the time recorded by Ed’s 
clock while unit time passes in the rest system. The simplest way to do that 
is by using two clocks at rest, say, those of Jack and Mac. Let us assume they 
are the distance v apart; then if Ed passes Jack at time zero he passes Mac at 
time 1 (by Mac’s clock). Jack watches for that event and sees it when his own 
clock reads (1+v/c). Ed’s clock seems to him slow by the factor 1/f, so it reads 
(1+v/c)/f=~/ {1—(w/c)?} at the time it passes Jim, whose clock, as we saw, 
reads unit time. So here is our answer: judged by two synchronized clocks at 
rest, a clock moving at the speed vw goes slow by the factor +/{1—(v/e)*}. 
That is the reciprocal of the ‘ Lorentz factor’ y=(1—(v/e)?)-}, the ratio of 
the relativistic mass and the rest mass of a body moving at speed v; 1/y gives 
the ‘ Lorentz contraction’, the factor by which a rod appears shortened when 
judged from a frame in which it moves (lengthwise) at speed v. 

This is the ‘ relativistic time dilatation’. It is not in conflict with the 
relativity principle; I have been careful to say ‘judged by two synchronized 
clocks ...’. To time a clock moving at uniform speed relative to an inertial 
frame we need two (or more) synchronized clocks in that frame. Thus we 
compare one clock with two others, a situation which is essentially unsymmetrical. 
We can of course ask about the rate of Jack’s ‘ resting ’ clock with respect to the 
two clocks of Ed and Fred; relative to them, Jack’s clock will now be losing, 
in complete agreement with the relativity principle. 


Fig. 4. Relativistic time dilatation. Let Jack and Mac be the distance v apart where v 
is Ed’s speed; then Ed, having passed Jack at zero time, reaches Mac when Mac’s 
clock reads unit time. That event is seen by Jack at the time 1+v/e (event B). 
The reading on Ed’s clock must be (1+v/e)/f where f= »/{(€+v)/(c—v)} is the 
Doppler factor. So Ed’s clock reads (1 —v?/c?)}4=1/y as the time difference 
between the two events O and A whereas that time difference is one time unit 
according to the clocks of Jack and Mac. Hence a moving clock appears slow by 
the factor 1/y when timed on passing two synchronized clocks at rest. 


It is vague and misleading to say ‘a moving clock goes slow BB AG late 
precise, one should say: ‘a clock moving at speed v relative to an inertial frame 
containing synchronized clocks is found to go slow by the factor 1/y= 
{1 —(v/c)?}# when timed by those clocks’. That statement does not contradict 
the relativity principle but indeed follows from it. 


22 O. R. Frisch 


There is a way of comparing a moving clock with just one clock at rest, if 
the moving clock changes speed and comes back to pass the other clock a 
second time. Let us say Albert with his clock accompanies Ed till they reach 
Mac; at that point Albert reverses speed and returns to Jack, taking the same 
time as for the outward journey. Jack will say to him ‘ you have been away 
two time units’, but Albert’s clock will read only 2/y. This really seems to 
mean that a moving clock goes slower than one at rest, and that would constitute 
a paradox: from the relativity principle we seem to have derived a result that 
contradicts it. But again our formulation was inaccurate: Albert’s clock has 
lost time relative to Jack’s, not because it has moved but because it has changed 
its motion on the way. So the situation once again is unsymmetrical. 


t 


JACK 


Fig. 5. An example of the ‘’T'win Paradox’. Albert leaves his twin brother Jack, going 
for three years (by his clocks) at the speed 0-8 c; then he reverses speed and returns 
home after another three years. His Christmas greetings at first are delayed by 
the Doppler factor (1—0-8)/(1—0-8)=3; but after nine years Jack sees Albert 
approach, his greetings come every four months, and at the end of the tenth year 
he is home. He in turn received only one of Jack’s greetings at the very end of his 
outward journey (one every three years, the same Doppler factor), and nine on the 
en (three a year). It all fits the expected time dilatation: 1/y= + {1 —(0°8)?) 


. It may help to discuss an example, designed to give simple numerical rela- 
tions (Darwin 1957). Albert leaves his twin brother Jack—who is said to be at 
rest—at X-mas at a speed of 0-8 ¢ and travels for three years, by his own clocks; 
he then reverses speed and gets home after another three of his years; so ic 


Time and Relativity 23 


would say, he has been away for six years in all. Both Jack and Albert send 
each other regular X-mas greetings by radio. The Doppler factor is 
/ {(1+0-8)/(1—0-8)}=3; hence Albert receives the first message only after 
three years, just as he turns back. But during the return journey he gets three 
messages a year, the last one just as he gets home. So he has received one 
message on the outward jouiney (at the end of it), and nine on the way back; 
hence ten years must have passed for Jack while Albert was away six of his 
years. This fits: the factor 1/y=4/(1—16/25)=3/5. 

How does this exchange of messages look from Jack’s point of view? He 
receives three messages from Albert on the way out, arriving at intervals of 
three years, because of the Doppler factor. So he sees Albert receding for nine 
years. But then the signals speed up: three arrive within one year, and with 
the last one, Albert is home, after ten years, although he has sent only six annual 
messages. It all fits; both of them observe the correct Doppler slowing (or 
speeding) of the other’s clocks—through the annual messages—depending on 
whether they see the other one receding or approaching. But to Jack the 
outward journey of Albert appears nine times as long as his return trip.So the 
different times recorded by the two are fully consistent with the relativity 
principle; any other result would indicate that their clocks had not been 
synchronized according to that principle. 

The difference between their clocks is that Jack’s has been at rest in the same 
inertial system all the time while Albert’s has been at rest in two different 
inertial systems. ‘The latter must therefore have suffered acceleration, during 
its change of speed, and it has been sometimes suggested that time is lost during 
that acceleration. But actually those accelerations can easily be kept so small 
that they would hardly affect a wristwatch, let alone an atomic clock. Further- 
more, if the overall journey was made longer the time lost would go up in pro- 
portion with the duration of the journey, while any effect of acceleration would 
remain unchanged and hence become less relevant. 

Let me point to a simple analogy. ‘Two motorists go from A to C, Joe ina 
straight line, Bill via B. On arrival they find that Joe has travelled 60 miles 
and Bill 70. Surely this is no paradox! Admittedly Joe has travelled in a 
straight line all the time and Bill most of the time, but no one would say that 
Bill acquired his extra mileage at the corner he had to turn at B. We are all 
familiar with the fact that a broken line is Jonger than a straight line between 
the same two points. But most of us are not yet familiar with the fact—which 
follows from the relativity principle—that the time interval between the same 
two events is shorter when measured along a broken line (i.e. by a clock that 
changes its speed) than when it is measured along a straight line (by a clock 
travelling at constant speed). 

Sometimes the objection is made that in general relativity all frames of 
reference are equally admissible and that therefore the situation between Jack 
and Albert is not really unsymmetrical; that argument will be discussed in a 
second article. 

Some people are willing to believe that clocks behave like that, but doubt 
whether Albert would really return looking four years younger than his stay-at- 
home twin. ‘That is indeed doubtful; he may well have aged a lot more than 
Jack, because of the discomforts of space travel! But an organism is a clock, 
though a poor one, easily affected by its surroundings; if those extraneous 


24 ~ O. R. Frisch 


effects are eliminated or allowed for it must behave like any other clock if the 
relativity principle is valid. 

There is, however, yet another way for comparing two clocks in relative 
motion, if we bring in another co-ordinate of space. Let us say, Jack wishes 
to compare his clock with that of the distant traveller Dan, who moves at speed 
v parallel to the x-axis and at distance b away from it. If 6 is large enough 
Dan will spend an appreciable time near enough to the y-axis (fig. 6) so that 
the signals he sends out during that time will all suffer practically the same 
delay in reaching Jack; then Jack can just look at Dan’s clock, without any need 
for corrections. Now Dan is at rest relative to Ed, and we remember that 
Ed’s clock is slow by the factor y= 1/ {1—(v/¢)*} if timed by Jack and Mac as it 
passes them; so we conclude that Dan’s clock must be slow by the same factor 
if timed in the same way. But actually that proviso is unnecessary: Dan’s 
signals are received by Jack and Mac essentially simultaneously, provided they 
come from a distant point near the y-axis. So the timing may be done entirely 
by Jack, with the same result: that Dan’s clock goes slow by the usual factor 


1/y. 


x 


Fig. 6. The ‘transverse Doppler effect’. This is not a space-time diagram. The bold 
arrow DB indicates the path of the distant traveller Dan, the two dotted lines on 
the left the paths of two light signals, sent out one time unit apart by Dan while 
close to the y-axis. They will travel the same distance (in first order); Jack will 
see them from the y-direction and receive them 1/y time units apart, because of 
time dilatation (see the text for the detailed argument). What about time signals 
from Jack, seen by Dan from the y direction? They do not travel parallel to the 
y-direction in the rest frame but at an angle a where sin a=v/e (see insert). The 
time interval between their arrivals at Dan’s at the points A and B is AB/v (since 
Dan travels at the speed v=1+(wv/e) . AB) or equal to 1/(1—v2/e2). This is longer 
by the expected factor y than the time unit on Dan’s clock; so Dan, looking along 
the y-axis, sees Jack’s clock slow by the factor 1/y just as Jack sees Dan’s clock. 


But is this not another paradox? Here we have two clocks, each at rest in 
an inertial frame, synchronized according to the relativity principle; how then 
can one of them go slower than the other? 


Time and Relativity Zz 


Let us be precise. What Jack observed is that Dan’s clock signals, received 
from a direction perpendicular to the x-axis (their line of relative motion) are 
slow compared to his own. The relativity principle demands that the reverse 
statement, obtained by exchanging the names ‘ Jack’ and ‘ Dan’, should also 
be true. And that is indeed so. The point is that the signals that Dan sees 
arriving from a direction parallel to the y-axis do not travel in that direction 
relative to the rest frame: they travel at the angle « relative to the y-axis where 
sin a= /¢. So in the rest system the delay between two light signals in reach- 
ing Dan is not the same although it is (practically) the same if referred to Dan’s 
system. Let us compute that difference in delay, referring everything to the 
rest system. Let A and B be the two points at which two light signals, emitted 
one time unit apart by Jack’s clock, reach Dan. The second light signal has the 
extra distance (v/¢) . AB to go, so they arrive (1+(v/¢)) . AB time units apart, 
and in that time Dan has covered the distance AB, going at the speed v. Hence 
(1+(v/¢)) . AB= AB/z, or AB/v=1/(1—(wv/e)?). That is the interval between 
the arrivals of the two unit ticks from Jack’s clock, and it is indeed longer than 
Dan’s time unit, by the expected factor y; so he will report that Jack’s clock 
goes slow by that factor, as we know he must. 

I hope I have made it clear that the behaviour of clocks is not really paradoxi- 
cal in the cases I have discussed. This does not prove that no real paradox, 
no logical inconsistency, can be found anywhere in the theory of relativity; 
such a proof must be mathematical and has indeed been given (Reichenbach 
1920). 

You may still ask what guarantee we have that clocks really behave in a 
manner consistent with the relativity principle. Well, the most direct proof 
comes from the study of beams of unstable particles such as mesons. Such a 
beam constitutes a clock of a kind: the number of particles diminishes along the 
beam as exp(—t/f,) where ¢, is the mean life of the particles and ¢ is their 
‘ proper time’, i.e. the time as measured with a clock that moves with them. 
For instance for (charged) pions t,=2:5 x 10-® sec in which time a pion could 
travel 25 feet if it went at the speed of light. Without the relativistic slowing- 
down of clocks, a pion beam would have faded out after a few hundred feet of 
travel. But with big synchrotrons one can produce pions with v/¢=0-9999 
where the factor y is about 70; intense pion beams can be obtained extending 
to many hundred feet from the target, and indeed the decrease in beam intensity 
along the beam confirms that the pions decay about 70 times more slowly—by 
laboratory time—than when they are at rest. Less clear-cut but equally 
striking examples have long been known from cosmic rays (see Janossy 1950). 

A somewhat more indirect proof was obtained by Ives and Stillwell (1941). 
They accelerated hydrogen ions to about 40 kv (v/e=0-006); some of the ios 
then captured an electron and emitted spectral lines, whose Doppler shift was 
accurately measured. The transverse Doppler effect could not be observed 
directly because it would have amounted to only 0-002 per cent change in wave- 
length, which would have been masked by an uncertainty of less than Q:2° in 
the direction (relative to the particle stream) of the light observed. Instead 
they measured the Doppler shifts at two directions forming the same small 
angle « with the forward and the backward direction of the particle stream; the 
mean of the two shifts is }(4/{(e+v)/(c—v)} + V {(e—v)/(e+v)}) cose= 


y cos ¢, and this value was indeed observed. 


26 < O. R. Frisch 


There is also an experiment by Hay et al. (1960) who showed that gamma 
ray absorption is diminished by fast transversal motion of emitter and absorber; 
but this experiment employed rotary motion and will consequently be discussed 
in a later article. 

It may also be asked by what mechanism a clock adjusts its rate when its 
state of motion is altered. ‘That mechanism could probably be worked out for 
each type of clock, but that would usually be a very difficult exercise, and not 
really necessary. We have no reason to doubt the accuracy of the relativity 
principle. Several of its consequences have been accurately tested. Many 
elastic collisions between fast protons and protons at rest have been seen in 
cloud and bubble chambers, and the speed and angles of the two protons 
resulting from such collisions agree with Einstein’s formulae; and in proton 
synchrotrons the magnetic field must be made to increase accurately as a function 
of the accelerating frequency, a function computed from the relativistic mass 
increase, or else the synchrotron does not work. In view of all this evidence 
it seems sensible to use the relativity principle when it offers us a quick answer 
to a question, just as we use the principle of energy and momentum conservation 
to predict—in part—the behaviour of a mechanical system without laboriously 
integrating its equations of motion. 


Fig. 7. Time dilatation illustrated by a ‘rod clock’. In 7a the bold lines represent the 
paths of the two ends of a rod of length a, travelling crosswise at the speed v. A 
light signal, travelling back and forth along the rod, traces out the dotted line, with 
sin a=v/c. Hence the time interval between the ticks A and B is AC/¢. cos a= 
(2a/c)(1 —v*/c?)-4 = 2ay/e, compared to 2a/e for the same clock at rest; this is 
the usual time dilatation. 

7b is a space-time diagram, representing the same rod-clock in lengthwise 
motion, without Lorentz contraction, together with one at rest. In the latter, a 
light signal emitted at O returns at A, after a time 2a/c. In the moving clock it 
comes back at B, which is synchronous (in the rest frame) with C. The time 
interval OC=t,+t,; ty=OD/c=(OD a)/v, hence t,;=a/(¢—v); t,=BE/c= 
(a—BE)/v, hence t,=a/(e+v). So OC= (a/¢—v) + (a/¢+ v)=OA/(1 —v?/e?) = 
OA.y?. Thus the moving clock seems to be slowed by the factor y? instead of y 
But we must remember that a rod moving lengthwise suffers Lorentz contraction 
by the factor 1/y; then the time dilatation comes out correctly. 


Time and Relativity Dig) 


All the same it may be instructive to consider a very simple type of ‘ clock ’, 
namely a rod of length a with a flash lamp at each end, so constructed that each 
lamp will be set off by the flash from the other. Thus the lamps will flash 
alternately, the interval between two flashes being a/c, in the rod’s rest system. 
If the rod moves at right angles to its own direction, with the speed v, then 
the light signals—relative to the ‘ rest’ system—trace out a zigzag line, conse- 
quently the intervals between signals will be longer by a factor which can easily 
be seen (fig. 7a) to be the ‘dilatation’ factor y. But what if the rod moves 
lengthwise? In that case the simple argument indicated in fig. 7b would seem 
to show that the clock slows down by the factor 1/y2, not 1/y. But we have 
forgotten the Lorentz contraction: a rod moving lengthwise at the speed 
becomes shorter by the factor 1/y, speeding up the signals to the correct rate 
al/yc. ‘This is perhaps the simplest way to show that the relativity principle leads 
to the Lorentz contraction: if that contraction did not exist, the rate of the ‘ rod 
clock’ described would depend on its orientation relative to the direction of 
motion, in contradiction with the relativity principle. 

By the way, the whole last paragraph is nothing but a re-wording of the 
traditional discussion of the Michelson-Morley experiment: the two arms of the 
interferometer can be considered as two ‘ rod clocks’, and the historic experi- 
ment established their synchronous behaviour irrespective of orientation. 

So we should accept that within the framework of special relativity, i.e. for 
clocks that for most of the time are at rest in some inertial system, with no 
large differences in gravitational potential, the terms ‘ identical clocks’ and 
“clocks synchronized by the relativity principle’ are equivalent. That is no 
longer so with accelerated clocks or in the presence of strong gravitational 
potential differences, as we shall see in the second article. 


REFERENCES 


Darwin, C. G., 1957, Nature, 180, 976. 

DIncLE, H., 1956, Proc. Phys. Soc. A69, 925; 1957, Nature, 180, 1275. 

EINSTEIN, A., 1905, Ann. Physik, 17, 891; 1911, 35, 898. 

Hay, J. J., ScHiFFER, J. P., CRANsHAW, T. E., and Ecetsrarr, P. A., 1960, Phys. Rev. 
Letters, 4, 165. 

Ives, H. E., and STILWELL, C. R., 1941, ¥. Opt. Soc. Am., 31, 369. 

Janossy, L., Cosmic Rays, p. 22 (Oxford Press 1948 and 1950). 

MicuHetson, A. A., and Mor.ey, E. W., 1887, Am. 7. Science, 34, 333. 

Méssspauer, R. L., 1959, ZS. f. Naturf., 14a, 211. 

REICHENBACH, H., 1920, Relativitatstheorie und Erkenntnis a prior. 

SHERWIN, C. W., 1960, Phys. Rev., 120, 17. 


The Author: 

Otto Robert Frisch, O.B.E., F.R.S., is Jacksonian Professor of Natural Philosophy 
at the University of Cambridge since 1947, and Fellow of Trinity College. Educated 
in Vienna (Austria); did research in Germany, Denmark, U.K., and U.S.A., mostly on 


nuclear physics. 


Electromagnetism as a Second Order Effect 
Paper III. THe Biot—Savart Law 


by W. G. V. ROSSER 


Department of Physics and Astronomy, State University of Iowa, 
Iowa City, U.S.A. 


SUMMARY 


An historical survey of the theories of the forces between current elements 
is given and Ampére’s formula is compared with the Biot—Savart law. It is 
shown that the Biot-Savart law, in differential form, is consistent with both 
Classical Electromagnetism and the Special Theory of Relativity. 


1. INTRODUCTION 


There has been a great deal of discussion of the Biot—Savart law recently, 
particularly as it is applied to ‘isolated’ current elements. The object of 
this paper is not to present a new teaching approach at school level, but rather 
to fill in some of the background to the problem for the benefit of teachers, and 
in particular to indicate how the Biot-Savart law fits in both with the ideas 
developed in Papers I and II (Rosser 1959, 1960) and with classical electro- 
magnetic theory. It must be stressed at the outset that we cannot isolate current 
elements from the rest of the circuits of which they form part in order to 
measure the forces between the elements. Neither can we determine the 
magnetic field of a moving charge accurately by experiment, since the magnetic 
forces between moving charges are only ~ v?/c? times the electric forces between 
them. As in papers I and II we shall divide the Lorentz force on a charge 
g €.8.u. moving with velocity v, namely gE+(q/c) vx B, into an electric force 
q E and a magnetic force (q/c) v x B. 

When calculating the forces between two closed circuits (denoted 1 and 2) 
carrying uniform currents J, and J, e.m.u. respectively, it is usual to introduce 
the intermediate step of saying that circuit 1 produces a magnetic induction B,, 
and then say that this magnetic induction produces the force on circuit 2. It is 
generally postulated that the magnetic induction B, is given by 


ds, x 
B=1,p oe (1.1) 


where ds, is an element of circuit 1, and 7,, is the distance from the element to 
the point where the value of the magnetic induction is required. The resultant 
magnetic induction B, due to circuit 1 is obtained by integrating around the 
circuit. ‘This expression is often written in differential form and assumed to 
hold for current elements: 


dBi lt ee (1.2) 


The direction of dB, is given by the corkscrew rule. Equation (1.1) is called 
either Ampére’s law, Laplace’s law, Biot’s law or the Biot-Savart law. We 
shall use the last term as it now appears to be coming into common usage, 


Electromagnetism as a Second Order Effect Z 


The force on the second closed circuit carrying a steady current J, e.m.u. 
is obtained by evaluating 


Fo=1, 92.08, By (1.3) 
This equation is an extension of the expression for the Lorentz force. The magnetic 
force on any charge g; moving in the element J, ds, is given by qi/cvi x B,, 
where qj is in e.s.u. and vj is the velocity of the charge. Equation (1.3) follows 
immediately by summing this last equation over all the charges in the current 
element. The direction of the force is given by the left-hand motor rule. Since 
the Lorentz force is confirmed by experiments in electron optics, and eqn. (1.3) 
is a straightforward extension of it, then eqn. (1.3) in differential form is to a 
large extent confirmed by experiment. The total force on circuit 2 is obtained 
by integrating eqn. (1.3) around circuit 2. 
The two eqns. (1.1) and (1.3) can be combined giving what is known as 
Grassmann’s (1845) equation: 


ds, x ds, xr 
pee 7 7 ante (1.4) 


2 
This equation is a combination of the Biot-Savart law and the Lorentz force. 


Fig. 1. We have two current elements; J, . ds, is at the origin and consists of a positive 
charge gy at rest and labelled K, and a charge labelled L equal to —qg moving with 
velocity v2; I, . ds; consists of a charge +q, at rest and labelled M and a charge —q 
moving with velocity v, and labelled N. The force Fy shown, as calculated 
using the Biot—Savart Law, is the only force. 


It can be shown (c.f. Panofsky and Phillips 1955, page 113) that the forces 
between two complete circuits calculated in this way obey Newton’s Third Law 
that action and reaction should be equal and opposite. Furthermore the predic- 
tions of eqns. (1.1) and (1.4) are in agreement with results obtained using 
current balances of different geometrical configurations. However, it is well 
known that if the Biot-Savart Law is applied to ‘ isolated ’ current elements the 
forces predicted do not obey Newton’s Third Law (c.f. Tricker, R.A.R., 1955). 
For example, in fig. 1, according to the Biot-Savart Law, the current element, 


30 W. G. V. Rosser 


I,ds, produces no magnetic field at the position of the element J,ds, whereas 
the latter current element produces a field 
Teas, 
x 


at the position of the first element so that the magnetic force on I,ds, is 


Tass asn 
ye? 

there is no magnetic force on J,ds,. (The currents J, and J, are expressed in 
em.u.) At first sight this seems a little bewildering, e.g. Tricker (loc. cit. 
page 44) says 
“* We may on the one hand, assume that the Biot-Savart formula actually gives the force 
between current elements, in which case we shall be unable to apply the rules of mechanics 
to parts of circuits. It will be necessary to invent a new mechanics to deal with such cases. 
Alternatively, we may confine the application of the formula to complete circuits in which 
case ordinary mechanics will hold.” 

Before proceeding to show that the Biot—-Savart law in differential form is 
consistent with the theory of special relativity, when both the electric and 
magnetic forces between the electric charges in the elements are considered, 
we shall give a brief historical survey of the theories of the forces between 
current elements. 


2. HISTORICAL SURVEY 
The reader is referred to Whittaker (1951) for a fuller account of the subject. 


In 1820 Oersted discovered that a current in a wire could deflect a nearby 
magnetic needle. Oecersted’s results were reported to the French Academy on 
September 11th, 1820 by Arago and this started a period of intense activity. 
On October 30th, 1820 Biot and Savart announced that the force on a magnetic 
pole placed at any distance from a straight wire could be expressed as follows: 
“Draw from the pole a perpendicular to the wire: the force on the pole is at 
right angles to this line and to the wire, and its intensity is proportional to the 
reciprocal of the distance”. The result was analysed further by Biot (it is 
probable that Laplace co-operated at this point), and Biot suggested that the 
force on a magnetic pole due to a current element 7ds is proportional to 
ids xr/r? where r is the distance from ds to the pole. (This suggestion of 
Biot’s is quoted by Ampére 1825.) 


At the meeting of the French Academy on September 18th, 1820, Ampére 
reported that two parallel wires carrying current attract each other if the currents 
are in the same direction, and repel each other if they are in opposite directions. 
During the next few years Ampére carried out a number of experiments on the 
forces between current carrying conductors and his collected results were 
published in his famous Memoir (Ampére 1825). A full account of his experi- 
ments is given by Maxwell (1881) and an excellent short account by Tricker 
(1955); the reader is referred to these for full details of the experiments. Ampére 
designed an astatic current balance to perform his experiments. He had to 
devise null experiments as the currents available at the time were not steady 
enough to enable him to perform quantitative measurements. Ampére per- 
formed four experiments the results of which were, to quote Whittaker (1915) :— 


Electromagnetism as a Second Order Effect 31 


““(1) The effect of a current is reversed when the direction of the current is reversed. 
(2) The effect of a current flowing in a circuit twisted into small sinuosities is the same 
as if the current were smoothed out. 
(3) The force exerted by a closed circuit on an element of another circuit is at right 
angles to the latter, 
(4) The force between two elements of circuits is unaffected when all linear dimensions 
are increased proportionately, the current strengths remaining unaltered.” 


These results were not sufficient for Ampére to determine the force between 
two current elements uniquely, and Ampére had to make one extra assumption; 
he assumed that the force between two current elements was along the line 
joining them. Ampére suggested that the force on a current element J,ds, due 
to a current element J, ds, at a distance r away is given by 
aids {2ds, ds,— 5 (ds, ae (lee ny} (2.1) 


re 


dF=-r 


where r is a vector from J, ds, to J, ds, and J, and J, are in e.m.u. If we inter- 
change the roles of J, ds, and J, ds,, the direction of the vector r is changed, so 
that the force would be equal and opposite to that given by eqn. (2.1), hence 
Newton’s Third Law is satisfied by eqn. (2.1). If eqn. (2.1) is applied to the 
current elements shown in fig. 1, then it predicts that there should be no forces 
between them. Ampére’s formula was developed by people like Weber, Gauss 
and Helmholtz to account for electromagnetic induction. A feature of these 
theories was that they tried to relate all electromagnetic phenomena directly to 
forces between charges and between current elements without introducing the 
field concept. However, mainly due to the work of Faraday and Maxwell the 
concept of the magnetic field became fashionable. As it stands, Ampére’s 
formula in differential form (eqn. 2.1) cannot be separated satisfactorily into a 
magnetic field, so that with the rise of the field concept Ampére’s original formula 
was almost completely forgotten, and very few people these days have ever heard 
of it. We shall illustrate its use and compare its properties with the Biot-Savart 
law by calculating the force between two infinitely long parallel wires as shown 
in fig. 2. We shall calculate the force on an element J,ds, due to a current J, 


in the other wire. 


I, I, L, 
I, ds, sin® 
a In dso a FS 

| its ; 26] 

S| r = ds, ds,[2-3cos 
| v7 Ids, i) 

I, ds; 
(a) Ampére’s Method (b) Grassmann’s Method via the Biot—-Savart Law 


Fig. 2. The force between two parallel wires carrying uniform conduction currents I, 
and J, e.m.u. 


32 W. G. V. Rosser 


According to eqn. (2.1) the force on J,ds, a to I, ds, is given by 


dF= =r 2 {2ds,. Beach, r) (ds, . rh 


where now ds, . ds, = ds, ds, 
ds;.r = ds,7rcos@ 
ds,.r. = ds,r cos 0 
dn sa (2-3 cos? a) (2.2) 
r 


This force acts along the line joining the current elements. It is an attractive 
force if the bracket term is positive, i.e. if 2> 3 cos? @. It is of interest to observe 
that when 3 cos? 6=2, eqn. (2.2) changes sign and for @<cos14/(2/3) =35-3° 
the two current elements should repel. Hence, if eqn. (2.2) were correct, in 
certain positions like currents would repel each other. [According to the 
Biot-Savart law the magnetic field due to J,ds,, at the position of J,ds, is 
Ids, sin 0/r? and so the force on J,ds, due to J,ds, is equal to [,/,ds,ds, sin 6/r? 
in a direction perpendicular to J,ds, and not along the line joining the elements. 
This force is an attractive force for all values of 6.] 

Now according to Ampére the various current elements constituting circuit 1 
would give rise to forces given by eqn. (2.2) all acting along the lines joining 
the elements to J,ds,. By symmetry the components of these forces parallel to 
wire 2 cancel and so the resultant force on J,ds, should be an attractive force 


equal to 


| ai cos = Lil, a Be og ade, 


Now 


Sy) 2 tan 
ds, =asec?¢ dd and r=a sec¢ 
Hence ds, _ dg 
7 a 
reid +1/2 
*. Total force on I,ds, = a | (2-3 sin? ¢) cos ¢ d¢ 
—7/2 en 
_ Lyl,ds, Lt a singsints _ 21,1, ds, 
a 


—7/2 
This is an attractive force. It is perpendicular to J,ds, in agreement with 
Ampeére’s third experiment. 


Using the conventional approach via eqns. (1.2) and (1.3) we have: 
ae) 


Total force on Iyds,=1, 1.85 es 
r 
#0 
= 11,0 | cos d p 
—7/2 
+7/2 
a a 


Electromagnetism as a Second Order Effect 33 


Thus when integrated over one complete circuit, Ampére’s formula gives the 
same value for the force on any current element of a second circuit as given by 
the conventional method via the Biot-Savart law. Ampére proved that this 
result holds in general. In the present case both approaches give the same 
force per unit length of conductor, namely 2/,J,/a dynes/cm. 

The influence of Ampére’s work was very great. Writing fifty years later, 
Maxwell called Ampére the Newton of electricity and described eqn. (2.1) as 
being “ perfect in form and unassailable in accuracy”. Maxwell added that 
it “ must always remain the cardinal formula of electrodynamics’. However, 
Heaviside expressed a different opinion saying “ It has been stated on no less 
authority of the great Maxwell, that Ampére’s law of force between a pair of 
current elements is the cardinal formula of electrodynamics. If so, should we 
not be always using it? Do we ever use it? Did Maxwell in his Treatise ? ” 
(Heaviside 1888). The reason for neglecting Ampére’s formula (eqn. 2.1) was 
that it was more convenient to calculate the magnetic field using the Biot—Savart 
law (namely (eqn. 1.1)), though our example has illustrated that when integrated 
around one complete circuit, the two methods give the same force on any current 
element of a second circuit. It was pointed out in §1 that the Biot—Savart law 
contradicts Newton’s third law when applied to isolated current elements. 
We now proceed to show that the use of the Biot-Savart law in differential form 
is consistent with the theory of special relativity when we interpret electric 
currents in terms of the motion of electric charges, and provided we include both 
the electric and magnetic forces between the charges. 

In the 20th century, following the discovery of the electron, there have been 
rapid advances in the theories of conduction of electricity in solids. It would 
be beyond the scope of this paper to discuss these theories in detail. It will 
be sufficient for our purpose to appreciate that when the metallic atoms are 
packed closely together in a solid, some of the more loosely bound electrons 
become free enough to move under the influence of an electric field and give rise 
to conduction currents. In order to appreciate the orders of magnitude involved 
we shall consider as a ‘typical’ current element, a section 1 mm. long of a wire 
of copper of cross sectional area 1mm? Such an element would contain 
~ 8-5 x10! conduction electrons. These would be moving with a velocity 
distribution given, to a good approximation, by the Fermi gas model. The 
actual velocities of the electrons would be ~c/100 where c is the velocity of light. 
For a current of 1 ampere the mean drift velocity of the conduction electrons 
would be ~10-2cm/sec. When the current is steady the mean acceleration 
of the electrons in a straight conductor is zero, though the individual electrons 
are coutinually accelerated and retarded, leading to Joule heating. The electric 
charges of moving electrons are compensated by the presence of the positive ions 
which are virtually at rest in the conductor, so that in practice the conductor is 
electrically neutral apart from any capacity effects which we shall neglect. 

It is reasonable to try and interpret the magnetic field of such a current 
element, and the forces between two such current elements, in terms of the 
electric and magnetic fields produced by the charges moving in the elements. 
We shall discuss this problem from the viewpoint of both classical electro- 
magnetism and the formulae derived in Paper II. 

It must be stressed that the current elements are not to be thought of as 
being completely isolated from the rest of the circuits. We shall calculate the 


C.P. Cc 


34 W. G. V. Rosser 


contribution to the magnetic field due to the charges in that part of the circuit 
which we consider as a typical current element, at one instant of time only. 
Under the influence of the potential difference the electrons in this element will 
‘flow’ on, but under steady conditions they will be replaced by an equal 
number of other electrons. Due to the large number of electrons, even in a 
small section of wire, statistical fluctuations can be ignored, so that, provided 
the currents are steady, the magnetic fields and the associated forces should 
remain constant. 


3. Tue Brot—Savart LAW AND THE FORCES BETWEEN CURRENT ELEMENTS FROM 
THE VIEWPOINT OF THE THEORY OF SPECIAL RELATIVITY 


We start by considering the example discussed in §1 and illustrated in fig. 1. 
It will be assumed initially that each current element consists of a positive 
charge at rest and a negative charge moving with uniform velocity. We shall 
use the formulae for the total force between two charges moving with uniform 
velocity, which were derived in Paper II by taking the transformations of the 
theory of special relativity and Coulomb’s law as axiomatic. We found that 


F, =? (1 — v2/c2) {oe + (yy + 8t2)/c?} (3.1) 

Fy=4? (1—v2/c2) (1—vuy/c2)y (3.2) 

F,= 4? (1—v2/c2) (1 —vuz/c2) z (3.3) 
where 

s =4/ {02 +(1—v2/c2) (y2 +22). (3.4) 


These equations represent the force on a charge q, which is moving with uniform 
velocity u (uz, Uy, Uz) at a point (x, y, 2), due to a charge g, which is at the origin 
and which is moving with uniform velocity v along the x axis. By choosing 
suitable axes in each case, eqns. (3.1), (3.2) and (3.3) can be used to calculate 
the total force between any two of the charges shown in fig. 1. The results are 
shown in table 1. The combined force on K plus L is zero, whilst the resultant 
force on M and N has the components : 
2=O0-and 
(a EY ore a 
x? 4/{1—(v*,/c*) } 
In the above formulae 7, and 7, are in e.s.u. If the currents are expressed in 
e.m.u. then we have 


1,ds, 1,ds, 


Dion 
— which approaches - 
c CoN 


if Mer<c, 


Aids, I,ds, 


BY 


By (3.5) 


This is the value predicted by the Biot-Savart law. It can be seen from table 1 
that the forces between any two charges differ from equality of action and 
reaction by terms of order v?/c?. Both the electric and the magnetic forces 
between the charges are included in table 1. The electric forces on the positive 
and negative charges in a current element compensate each other, leaving only 
the magnetic forces which are of order v2/c2. Deviations from Newtonian 
Mechanics of the order of v®/c? are normal in the theory of special relativity. 
The new mechanics which Dr. Tricker suggests the Biot-Savart formula for 


Electromagnetism as a Second Order Effect 35 


current elements would require has been in existence since 1905, namely the 


theory of special relativity, but electric as well as magnetic forces must be 
included, 


Charges Pes i, 
M on K — G192/x° oa 
M on L + G1Go/x" = 
+ 
N on K —— (1—v,2/c2) => 
N on L — (1—v,2/c2) — 
K on M + 9192/ x" i 
K on N — G192/x" a 
L on M ah : — 
x? a/(1 — 9? /c?) 
ea + 4192 1 9192 1 U1Ug 
Pe (a7 c*) RPE AN wee). 1 oe? 


TABLE [* 


* The signs on the components refer to the directions of the axes implied in fig. 1, 
e.g.—Fx means a force in the negative x direction. 


In carrying out the calculations it has been assumed that the charges move 
with uniform velocities. This is not exactly true, as the charge carriers have a 
velocity distribution given approximately by the Fermi gas model. The charge 
carriers are also continually accelerated and decelerated, leading to Joule 
heating. We proceed now to discuss the effects of a velocity distribution and 
defer discussion of the effects of acceleration until §4. 


When there are a large number of charges present in both current elements 
it is easier to work out the magnetic field due to all the charges in one current 
element, and then apply the Lorentz force to calculate the force on the other 
element. It was shown in Paper II that eqns. (3.1), (3.2) and (3.3) can be 
separated into electric and magnetic fields due to g, and given by 


_ gr (1-0 fet) 
E= 78 0-0] int) (3.6) 
q (1 — v?/c?) 
B= 
crs (ie /e simg)ie72 (3.7) 
The force on q, is then given by 
F = g,E+(q,/c) ux B (3.8) 


The use of eqns. (3.6), (3.7) and (3.8) is equivalent to using eqns. (3.1), (3.2) 
and (3.3). Using eqns. (3.6), (3.7) and (3.8) we can obtain the forces given in 
table 1. 

Consider the current element 7ds shown in fig. 3. Without loss of generality 
we can choose the direction of the axes such that the element lies along the « axis, 
It will be assumed that the current element contains an equal number of positive 


36 W. G. V. Rosser 


(x, ¥, 2) 


Fig. 3. Model of a current element in which the conduction electrons have a velocity 
distribution. 


ions at rest, and electrons moving with different velocities vj in different direc- 
tions. Since the current is in the positive x direction, summing over the 
velocities of individual electrons in the current element zds we have 


E (Cou=¥ (e)2=0 (3.9) 
and rE Ei 
SY (v)e= <i (3.10) 
i=1 


where % is the mean drift velocity of the conduction electrons and n is the total 
number of conduction electrons in the current element. The total current 
flowing in the current element is given by 

t= (ig) Na) 
where N is the total number of conduction electrons per centimetre length of 
the conductor. Since the length of the current element is ds we have 


n= Nds 
Hence 
(—4) © (v)n=(—4) n (—0) =ids (3.11) 
or ne 
—~9g 5 y=ids. (3.12) 


i=1 
If one of the electrons in the current element ids moves with velocity v; then 
from eqn. (3.7) its contribution to the magnetic field at P is equal to 

— qv x r(1 — v;2/c?) 
cr3 {1 —(u,?/c?) sin?(v,, r)} 32 
Now, since the velocity of individual electrons is ~c/100 we can neglect terms 
of order of v4?/c?. 


Hence 

SHENG YALE 
cre? 

Writing this out in components we have 


(Bi)az= 4 | (ody 3—(v)z y | etc, 


Bi ~ (3.13) 


Electromagnetism as a Second Order Effect 37 


If we neglect the variations of r over the dimensions of the current element 
the total magnetic field at P becomes 


= (Bi)a = - —4 x | eave=( (oi)29 | 
where we are summing over all es aeuaes in the element zds. From eqn. (3.9) 


= (v%)y and » (ee are both zero. Hence 
Y (B)2=0. (3.14) 
i=1 

Similarly 

E (Boy= 4 & | (@dex—Coe= | 


t= 


+ q& 
x cr? = (%)a 
Using eqn. (3.11) we obtain 
n 1 . 
2 Bidu= ae aya tds. (3215) 
Similarly 
n ie 
= (Bi)z= a5 See [(wi)a 2 — (vi) z ¥] = ie ids y. (3.16) 


Since the current element lies in the w direction, ds has components dx, dy =0, 
dz=0. Equations (3.14), (3.15) and (3.16) can be written in the form 


dB= Es ds x r if 7 is in e.s.u., (3,17) 
cr 


dB=4 rdsxr if Lis in €.m.u., (3.18) 
Yr 


where dB is the resultant magnetic field due to the element 7ds. This is the 
formula for the Biot-Savart law in differential form (eqn. (1.2)). The formula 
used for calculating the magnetic field was derived from Coulomb’s law using 
the transformations of the theory of special relativity. Hence the use of eqn. 
(1.3) is consistent with the theory of special relativity. The force on a second 
current element z’ds’ at the point P (x, y, 2) can be obtained by summing the 
force (q/c)u; x dB over all the charges in the second current element, where uj 
is the velocity of one of the charges in the second current element. The 
resultant magnetic force on 2z’ds’ is equal to 


dF =p Ou, x dB = + 4 4B x du, (3.19) 
By analogy with eqn. (3.12) we have 
— g>u;?2=1'ds’. 
Hence, eqn. (3.19) becomes 


epee BS gs” ae a8: 
z C c 


38 W. G. V. Rosser 


If J’ is in e.m.u. then using eqn. (3.18) we have 


if 


df, = I'ds' x dB= "5 ds! xdsxr (3.20) 


This is in agreement with eqn. (1.4) in differential form. 


So far the effects of the accelerations of the conduction electrons have been 
neglected. It is convenient to calculate the magnetic field of an accelerated 
charge using classical electromagnetic theory. We shall show that, according 
to this theory, the effects of accelerations can be neglected for steady conduction 
currents in thin conductors, which are the conditions for the applicability of the 
Biot—Savart law. 


4. Tue Brot—Savart LAW FROM THE VIEWPOINT OF CLASSICAL THEORY 


We shall start our review from Maxwell’s equations in differential form, 
which in Gaussian units are 


10B 
iene 4.1 
curl E oe (4.1) 
10D 4zj 
pa Meee EL, i 
curl H AEE de ; (4.2) 
div D =47zp (4.3) 
div B =0. (4.4) 


It should be noted that the field quantities, as well as the co-ordinates and 
time, refer to one point of space at one instant of time. Thus, for example using 
eqns. (4.1) to (4.4) we can discuss the behaviour of electromagnetic waves 
without relating the waves to the current and charge distributions which 
originally produced the waves. We shall proceed first to show that the Biot— 
Savart formula for the magnetic field of a charge moving with uniform velocity 
is consistent with Maxwell’s equations in differential form. 


p p 
eal js 
vee Se ® ® | aay 
qv : AF ; 
\ vst \ 
ay © (b) O® 


Fig. 4. Calculation of the magnetic field of a charge moving with uniform velocity using 
Maxwell’s equations. 


Consider a ‘ point’ charge g moving with uniform velocity v in a straight 
line as shown in fig. 4. Let it be at the position 2 at a time t as shown in fig. 4a. 
Since div B is zero the lines of magnetic induction can neither start nor end. 
Since we have cylindrical symmetry about the direction of motion, it is reasonable 
to assume that in empty space, the lines of B and H are in the form of circles 
with their centres on the line along which the charge is moving. By symmetry 
the intensity of the magnetic field is uniform along one of these circles. Integrat- 
ing eqn. (4.2) over the area enclosed by a circle of radius a as shown in fig. 4 (a) 


: 


ne 


Electromagnetism as a Second Order Effect 39 


we have 


ot 


where dS is an element of area. Now since no conduction current flows through 
the area we have [jdS=0. Using Stokes’ theorem eqn. (4.5) becomes 


frida!2 [pas 


[oust ise AF ae =i ds (4.5) 


ot 
or 
i 70 
=> — —|D.dS. : 
2 7ac sil y eo) 


The integral on the right-hand side of eqn. (4.6) represents the flux of D through 
the circular area 2 at a time t. Now consider an earlier time t—6t. At this 
instant a charge moving with uniform velocity would be at the position 1 as 
shown in fig. 4(b). Part of the flux going through circle 2 at a time ¢ went 
through the cylindrical area 27 a vdt at a time t— dt. Hence if the velocity of 
the charge is uniform ina time interval 8¢, the increase in flux through area 2 
in a time dt is equal to 27 av6tD,, where D, is the component of the electric 
displacement at the point P in a direction perpendicular to the direction of 
motion of the charge. Hence for the circle of radius a at position 2 we have 


d 
= |P .dS=27avD,. 
Substituting in eqn. (4.6) we have 
H=v/cD,. 


Now if we neglect dB/dz, eqn. (4.1) becomes curl E=0, and from eqn. (4.3) it 
then follows that we can apply Coulomb’s law to calculate the electric field at P. 
Hence we have numerically 


E=D=q/r’ and D, =q/r* sin 6, where 0 is the angle between v and r. 


Substituting in eqn. (4.8) we find 
3 H= © sin 6. | (4.9) 
CY 2 
This is in agreement with the Biot-Savart law. The effect of including the 
term in 0B/ét would be to produce changes ~v?/c? in E and D. Hence the 
use of the Biot—Savart law for the magnetic field of a charge moving with uniform 
velocity is consistent with Maxwell’s equations. 


Using Maxwell’s equations, it is only possible to relate the electric and 
magnetic fields of moving charges to the motion of the charges in a few simple 
cases, and then only by round about methods of the type we have just used. To 
proceed to the general case of an accelerated charge it is convenient to introduce 
the vector and scalar potentials A and @ defined by the relations 


1 0A 
=— Fe 4.10 
E grad © won (4.10) 
B= curl A (4.11) 
subject to the Lorentz condition, which in empty space is 
div Pees =0. (4.12) 


c ot 


40 W. G. V. Rosser 


We shall only outline the procedure. The reader will find an excellent account 
of the subject matter of the remainder of this section in Panofsky and Phillips 
(1955, Chapters 13, 18 and 19). Using eqns. (4.10), (4.11) and (4.12), together 
with Maxwell’s equations we obtain for empty space, where »=k=1 the 
following equations 


VIA- 5S = —4ni/c (4.13) 
y.o— — = —47p. (4.14) 
The solutions of these equations are 
Dixy, 2; n= (2 sae = bsrte) dx. dyin az. (4.15) 
and 
A= jee CUES EDO (4.16) 


where x, y and g refer to the point where we want the potentials at a time f. 
The charge and current densities at the point x’, y’, 2’ at a distance z from the 
field point relate to the values at a time r/c earlier. Hence, according to eqns. 
(4.15) and (4.16) the contributions of p(x’, y’, 2’) and j(x’, y’, 2’) to the potentials 
at the point x, y, x are propagated from the sources with velocity c. For this 
reason the potentials A and @ are generally called the retarded potentials. 
With a moving charge the potentials at a distance r from it must be related to 
the velocity and acceleration of the charge at an earlier time, namely at what is 
called its retarded position. This is illustrated in fig. 5. In order to obtain the 


Present position of the charge at a 


time t if accelerations are present, 


Present position of the charge at a time t 
if it had travelled with uniform velocity v. 


Observation point 
at atime t. 


Fig. 5. ‘The vector r is the distance from the retarded position to the point of observation ; 
v is the velocity of the charge and w its acceleration at the retarded position, that is at a 
time t—r/c. 


Electromagnetism as a Second Order Effect 41 


field of a moving ‘ point’ charge such as an electron, it is normal when calculating 
® and A to allow for the retardation effects associated with various parts of 
the ‘point’ charge, but subject to the condition that the total charge of the 
electron is constant at all time. In this context by a ‘ point’ charge one means 
a charge whose dimensions are finite, but are very much smaller than r. One 
then obtains the Liénard-Wiechert potentials, viz. :— 


ue q 
ee [r—r.v/c] el) 
and 
1 dvi 
Ya Ney ais ae Otel WET 
c[r—r.v/c] tee) 


where the symbols are illustrated in fig. 5. The quantities inside the brackets 
refer to values at the retarded position. 


Using eqns. (4.8) and (4.11) we obtain from eqns. (4.17) and (4.18) the formulae 
for the electric and magnetic fields of an accelerated charge, in terms of the 
velocity and acceleration of the charge in the retarded position. 


em S(r- 2) (1-5) + Bex [ (r-) xv] (4.19) 
p= 2 (1-3) + oe tre [ (eZ) xo] (4.20) 


where s is now given by the eqn.: s=r—r.v/c. The vectors v and r in eqns. 
(4.19) and (4.20) refer to the values at the retarded position. 


and 


Now the first term in eqn. (4.20) does not contain the acceleration, and is 
the same as eqn. (3.7), and is the formula we have used in §3 for the magnetic 
field of a charge moving with uniform velocity. If v<c this term reduces to the 
Biot-Savart law. We now proceed to discuss the order of magnitude of the 
second term, which is the radiationterm. If v<c wecan puts~rand r—rv/c~r. 


We then have 
Braad = ot vx<r. 
cer? 


Notice if v changes direction so does Brag. Consider the straight current 
element shown in fig. 3. When the current in the whole circuit is steady, then 
the current element is in a state of dynamic equilibrium, and the velocity distri- 
bution of the electrons leaving the element is the same as the velocity distribution 
of the electrons entering the element. There are accelerations due to the 
potential difference and retardations due to collisions, but the average accelerations 
of the conduction electrons is zero once the current reaches a steady value. Hence 
the radiation field due to the accelerating electrons is compensated by the 
radiation field in the opposite direction due to electrons suffering retardations. 
Similarly, the accelerations due to the collisions with the boundary of the con- 
ductor are opposite in direction at opposite sides of the conductor, and for thin 
conductors their radiation fields should compensate. There are also centripetal 
accelerations in curved conductors since the electrons flow along curved wires, 
In order to get some idea of the order of magnitude of this effect it will be 
assumed that the electrons move with uniform velocity v (the mean drift velocity), 


42 W. G. V. Rosser 


and have centripetal accelerations 0?/p, where p is fthe radius of curvature of 

the current element. The radiation field of one such electron of charge —gq at 

a distance 7 from it would be 

Bini 2 ear (4.2) 
Crp 

whilst the magnetic field given by the Biot-Savart law would be 

qv XV. 


B = 
B-S cr? 


Neglecting geometrical factors we have 
Brad s Ur 
Bp_s ¢p 
Since v<c we can neglect Brag compared with Bz. 

We have seen that, according to classical electromagnetic theory, when the 
current in a current element is steady then we can neglect the effects of the 
accelerations of the electrons when calculating the magnetic field due to the 
current element. The Biot-Savart formula is then a very good approximation. 
Naturally when the current is varying the radiation field becomes more 
important. Radiation effects can normally be neglected in ordinary A.c. 
theory, but at very high frequencies, when v may be large, the radiation field 
becomes important. 

In the theory of special relativity deviations from Newton’s Laws of Motion 
~v*/c? are accepted as normal. How then were the deviations from Newton’s 
Third law, predicted by the Biot-Savart law, reconciled with classical electro- 
magnetic theory? In classical electromagnetic theory it was assumed that there 
was energy equal to (E.D+H. B)/87 ergs/cm? in the electromagnetic field. 
This viewpoint is used in deriving the Poynting vector (c/47) Ex H, which is 
taken to represent the rate of energy flow in the electromagnetic field. If one 
associates a linear electromagnetic momentum density of (1/47c) Ex H with the 
electromagnetic field, it can be shown that the sum of the momentum of the 
matter and of the momentum of the electromagnetic field in an isolated system 
of moving charges is constant with respect to time, though the law of action and 
reaction is no longer true for matter considered on its own (c.f. Abraham & 
Becker 1932). ‘The special case of current elements was treated in detail by 
Page & Adams (1945). 

We illustrate the interpretation qualitatively with reference to fig. 3 of 
Paper II. Consider the charge g, moving to the right giving rise to fields FE, 
and H, at a point in empty space. Consider also the charge g, moving to the 
left and producing fields HE; and H, at the same point. The electromagnetic 
linear momentum per unit volume is 

ai H,+E,x H,+ E,x H, +E, x H, (4.24) 
The two terms 1/47c E, x H, and 1/47c E, x H, remain constant if the velocities 
of the charges remain constant, and are responsible for the electromagnetic 
masses of the particles when they are accelerated. The other terms vary; Cw: 
we can visualise that when q, and g, move as illustrated in fig. 3 of Paper II, 
the magnitudes of £, and H, vary at all points in space and 
a E,x H, dV 


Electromagnetism as a Second Order Effect 43 


integrated over all space varies with time. Similarly 1/4mc fE,;xH, dV varies 
with time. Page & Adams show that 


ame 
raped (ae H, +E, x H,)ar | 


balances the difference between the forces between the charges, illustrating how 
according to classical electromagnetic theory, the sum of the linear momenta of 
matter plus electromagnetic field is constant. (The interested reader is referred 
to Page & Adams for details of the calculation.) 


Since the linear momentum of the electromagnetic field varies with the 
configuration and velocities of the charges, it follows from classical electro- 
magnetic theory that the total linear momentum of the matter in an isolated 
system considered on its own should vary. At one time it was postulated that 
changes in the electromagnetic linear momentum gave rise to forces on the 
ether, which then transmitted these forces. According to Lorentz’s electron 
theory the ether does not transmit forces. If the only forces are the forces 
acting on matter, and if these are given by the Lorentz force, then Newton’s 
Third law does not hold in Lorentz’s electron theory. On this point Lorentz 
wrote: 

“but as far as I see, nothing compels us to elevate that proposition to a 
fundamental law of unlimited validity ” 
According to the theory of special relativity the only observable forces are the 
forces acting on matter. Deviations ~ v?/c? from Newton’s Third law are 
accepted as normal for the forces between separated charges. 


5. DISCUSSION 


As experiments cannot be carried out to determine the force between 
isolated current elements, there is still some controversy as to the exact nature 
of the force between current elements. We have shown that the Biot—Savart 
formula for the magnetic field due to a current element or a moving charge, 
together with the Lorentz force, are consistent with both the theory of special 
relativity and classical electromagnetic theory. However, no theory is final, and 
we must admit the possibility that special relativity may be replaced one day. 
There are some who claim that Maxwell and Einstein may have led us up a 
blind alley and they try and base their electromagnetism on Ampére’s formula 
(eqn. (2.1)). It is claimed by some (e.g. Robertson 1945 where references to 
other works will be found) that Ampére’s formula gives the forces on movable 
parts of circuits more accurately than Grassmann’s formula (eqn. (1.4)). 
Ampére’s approach has been developed into a fairly complete electrodynamics 
by Moon & Spencer (1954), who abandon the theory of special relativity and 
interpret the force between moving charges in terms of equality of action and 
reaction, and return to the Galilean transformations. The best known of the 
theories suggested as alternatives to special relativity is that of Ritz. An excellent 
account of this theory is given by O’Rahilly (1938). Future developments in 
quantum electrodynamics or some other future theory may possibly necessitate 
a return to such a theory, but the theory of special relativity has a large measure 
of support at present and the Biot—Savart law in differential form is consistent 
with it. Furthermore, at present, classical electromagnetism and the theory of 


44 W. G. V. Rosser 


special relativity are normally taken as the starting points for quantum 
electrodynamics. 

As it is not necessary to discuss the magnetic field of a moving charge at 
school, it is undoubtedly wiser to leave the Biot-Savart law in integral form at 
all times at school. In this form it can be tested experimentally using current 
balances where action and reaction are equal and opposite, and the students do 
not have to face the rigours of the concepts of special relativity at too tender an 
age. Even at University level nothing is gained by putting the Biot-Savart 
law in differential form until after Maxwell’s equations are developed, as it is 
possible to go directly from the integral Biot-Savart law to Ampére’s circuital 
theorem and vice versa. 


ACKNOWLEDGMENTS 


The author would like to thank Professor G. K. T. Conn, Professor E. G. 
Cullwick, Mr. R. Jakeways, Mr. N. Portman and Dr. R. A. R. Tricker for 
valuable comments and advice. 


REFERENCES 


AxsraHaM, M. and Becker, R., 1932, Classical Theory of Electricity and Magnetism, p. 244; 
Blackie. 

Ampgre, A.-M., 1825, Mem. de l’Acad., 6, 175. 

GRassMANN, H., 1845, Ann. d. Phys., 66, 1. 

HeavisIbE, O., 1888, Electrician, p. 229. 

MaxwELt, C., 1881, Electricity and Magnetism, 2nd Edition, Vol. Il. (Oxford: Clarendon 
Press). 

Moon, P. and Spencer, D. E., 1954, Four. Franklin Inst., 257, 203, 305 and 369. 

O’RaHuiLLy, A., 1938, Electromagnetics. (London: Longmans). 

Pace, L. and Apams, N. I., 1945, Amer. 7. of Phys., 13, 141. 

Panorsky, W. K. H. and Puiturps, M., 1955, Classtcal Electricity and Magnetism (Addison- 
Wesley). 

RosertTson, I. A., 1945, Phil. Mag., 36, 32. 

Rosser, W. G. V., 1959, Contemporary Physics, 1, 134; 1960, Ibid., 1, 453. 

SmyTuHE, W. R., 1939, Static and Dynamic Electricity, p. 494 (New York: McGraw-Hill). 

Tricker, R. A. R., 1955, On the Teaching of Electricity in Schools (London: John Murray). 

Wuittaker, E., 1951, A History of the Theories of Aether and Electricity (London: Thomas 
Nelson & Sons Ltd.). 


Cerenkov Radiation: its Origin, Properties and Applications 


by J. V. JELLEY 
Atomic Energy Research Establishment, Harwell, Berkshire 


SUMMARY 


In this account of Cerenkov radiation, the optical analogue of the 
“supersonic bang’, great emphasis is laid on the underlying physical 
principles of the effect. The historical introduction is followed by a qualita- 
tive account of the basic mechanism of the generation of light by the passage 
of fast charged particles through homogeneous transparent media. After a 
discussion of the relationship of Cerenkov radiation to other processes 
connected with the interaction of fast particles with matter, the article turns 
to the practical applications of Cerenkov detectors and counters in the fields 
of cosmic-ray and high-energy physics. In conclusion, the author describes 
some interesting experiments carried out by his colleagues and himself on 
light flashes from the night sky. 


1. INTRODUCTION 


The subject of Cerenkov radiation occupies but a very small niche in modern 
physics, although its applications are becoming ever more numerous in 
specialized fields, in particular in the new and exciting studies of the properties 
of particles of very high energy. 

While this rather unusual radiation process has been known to the majority 
of physicists for many years, the appreciation of its significance was heightened 
in 1958 when the Nobel Prize in physics was awarded jointly to three Russian 
scientists for their extensive studies of this radiation, studies which have con- 
tinued over a period of many years, from 1934 to the present time. The three 
physicists concerned, all members of the Academy of Sciences of the U.S.S.R., 
are Cerenkov, an experimentalist who carried out the basic investigations from 
1934 to 1938, and two theoreticians, Frank and Tamm, who in 1937 were the 
first to present the correct interpretation of Cerenkov’s experiments. Closely 
associated with these names is that of Vavilov, the professor under whom 
Cerenkov was working at the time. 

To this historical note it is only fair to mention that the basic phenomenon 
had already been noticed, and was under study, by a Frenchman of the name of 
Mallet (1929) whose work is almost unknown or frequently ignored. Mallet’s 
work was not as extensive as that of Cerenkov’s and he, Mallet, offered no 
theoretical interpretation of his observations. 

What were these early observations ? Looking back even further, to the 
turn of this century, we find that many early workers in the then new and 
revolutionary field of radioactivity had noticed that transparent media such as 
glass, water and certain crystalline substances emitted a very feeble bluish-white 
glow when placed close to strong radioactive sources ; Madame Curie was one 
who noticed this effect, for she mentions in her records that the glass bottles 
containing her strong concentrates of radium salts were found aglow in the dark. 


46 JV felley 


These observations are mentioned to emphasize the lapse of time between the 
earliest awareness of the existence of the phenomenon, and its detailed study and 
interpretation. 

Two factors contributed to the delay in the study of this feeble light from 
substances near strong radioactive sources. First, the discoveries associated 
with the radioactivity itself were far more significant, and all pervading, to those 
engaged in the work at that time. Secondly, it was assumed, and taken for 
granted uncritically, that these faint visible radiations were merely due to some 
fluorescence phenomenon. It was not until Mallet, and Cerenkoy later, began 
to investigate the phenomenon in some detail, that it was subsequently dis- 
covered that this was indeed a new mechanism of light generation, not in any 
way connected with fluorescence. 


2. DESCRIPTIVE EXPLANATION 
Cerenkov radiation is an electromagnetic shock-wave phenomenon, the 
direct optical analogue of the supersonic bang, or the bow wave from a swiftly 
moving vessel on water. 


STATIONARY 


W<U (MIN) 
pe. 


GSS 


(A) 


U=U (MIN) 
pie 


(C) 


Fig. 1. ‘To illustrate the formation of the bow wave from a ship: (a) vessel stationary ; 


(b) vessel moving slowly; (c) the critical velocity; (d) vessel moving faster than 
the velocity of surface waves on the water. 


Let us for a moment look at these two more familiar processes first. If we 
stand at the bow of a ship which is at anchor in still water, and lower a weighted 
fishing line into the water, we will find that if we rapidly agitate the weight up 
and down through the surface of the water, a succession of circular waves will 
radiate out from the point where the agitation is taking place, as shown in 
fig. la. Now suppose the ship lifts anchor and commences to move slowly 
forward, while we continue to manipulate the fishing line as before. Circular 


Cerenkov Radiation : ws Origin, Properties and Applications 47 


wavelets will still be set up and will radiate as before, but now the individual 
circles will no longer be concentric, so that the pattern will appear as shown in 
fig. 10, with the circles more closely spaced in front than behind. As the ship 
gets under way a critical velocity is reached, fig. 1c, for which all the circles 
touch each other; there is now a common wavefront along a tangent to the 
circles, this wavefront being perpendicular to the direction of motion of the 
vessel, ‘This condition occurs when the speed of the boat is precisely the same 
as the velocity of the surface waves on the water. 

If now the ship moves faster still, that is, if its speed exceeds the wave velocity, 
the points of origin of freshly produced circles will overtake the wavefronts of 
circles already generated. In this case the wave pattern will appear as in fig. 1d; 
the wavefront common to all the circles now consists of two straight lines tan- 
gential to the circles, forming a Vee, as in the familiar bow wave. 

In this example we have of course considered the formation of waves only 
in one plane, i.e. radiation in only two dimensions. A process rather more 
akin to Cerenkov radiation is that which occurs in acoustics when a missile or 
fast aircraft moves through the air at a speed in excess of the velocity of sound in 
air; this leads also to a V-shaped shock-wave, but in this case, since we are 
considering a three-dimensional medium, the wavefront lies on a conical surface, 
the axis of which coincides with the path of the missile and the apex of which is 
located at the tip of the missile ; in the case of an aircraft, the shock-wave front 
gives rise to the now familiar ‘ supersonic bang’ which occurs when the conical 
surface passes over the observer. 


3. SHOCK-WAVES IN OPTICS 


Having discussed at some length how hydrodynamic and acoustic shock- 
waves are produced, we are in a position to consider Cerenkov radiation, which 
is the optical or electromagnetic equivalent of the shock-wave phenomenon. 
For this process to occur it is necessary for a charged particle, e.g. an electron 
or proton, to travel through a transparent medium at a velocity in excess of the 
velocity of electromagnetic waves in the medium. ‘The particle takes the réle 
of the missile or ship, the medium the réle of the air or water, in analogy with 
the two examples above, and the velocity with which we are now concerned is the 
velocity of light. But, the reader may ask, nothing can ever move faster than 
light. This limitation, imposed by relativistic considerations, only applies, 
however, in a vacuum ; it is quite feasible for a particle to move at a velocity 
just below that of light 7 vacuo, usually denoted by c, while, in the medium, 
the wave velocity will be much smaller, owing to the refractive index of the 
medium. If the refractive index of the medium is n, the velocity of light 
therein is (c/n), and if the velocity of the particle is v, the essential criterion which 
must be satisfied for a shock-wave to be produced is that v must exceed (c/n). 

The majority of transparent solid and liquid media have refractive indices n 
around 1-5 (a few very heavy glasses go up to n= 2) so that, with c= 3 x 10” cm / 
sec, we find that the minimum particle velocities are exceedingly high, in the 
region of 2x 10! cm/sec ; single particles of this speed, if they are electrons, 
are emitted from certain radioactive substances, and particles of many types, 
with sufficient velocity, can also be found naturally, in the cosmic radiation ; 
they may be also produced artificially in many of the various types of particle 


accelerating machines. 


48 Jo Vajetley: 


4, FUNDAMENTAL PRINCIPLES 


We will suppose an electron of charge e is traversing a slab of glass at a 
velocity v, as shown in fig. 2, in which the undisturbed atoms are represented 
by circles. he scale is such that the average spacing of the atoms from their 
neighbours is ~5x10-8cm. ‘The region of glass in the immediate neighbour- 
hood of the particle will be polarized, in much the same way as the whole slab 
of glass would be polarized if it were placed between two charged metal plates 
as in a condenser. ‘This electrical polarization of atoms, viewed on a classical 
basis, arises from the distortion of the electron orbits, so that each atom behaves 
as a tiny dipole in which the mean electrical centre of negative charge of the 
circulating electrons becomes slightly displaced from the position of the positive 
charge of the nucleus of the atom. Since our moving particle is in effect a point 
charge, the polarization of the medium is very localized, whereas polarization 
extends throughout the dielectric in a parallel plate condenser. 

When the charge is moving very slowly the polarization at any instant of 
time is as shown in fig. 2a. The polarization is completely symmetrical about 
the instantaneous position of the charge, and as the charge moves (slowly) 
through the glass, the region of polarization moves along with the charge while 


On 
fe) 0 200 ©) OFO000 
Soo Poo spe} se 0°65 
009 099 Oe 020° o 
err aeleieke OX9| GA O 
O8S|S00 OSs 
O32. of C20 
ee OPM0o0 O 
@|SO6 0 Q50 O! O Oo 
(ere) 0010 0% Og 00|002%5 
FiOS Foe GO LUe SO 
oC os fare VARS. 

9P 

Y (a) P (b) 

B 


Fig. 2. The polarization set up in a dielectric by the passage of a charged particle: 
(a) at low velocity; (6) at high velocity. 


of course the atoms themselves remain stationary. Under these conditions of 
spherically symmetrical polarization, the elementary dipoles cannot radiate, 
because at any considerable distance from the disturbing particle there will be 
no resultant dipole field. It should be noted that if we are looking for radiation 
in the visible region of the spectrum, the wavelength of such radiation is con- 
siderably greater than the diameters of the atoms, so that for each polarized 
atom there is another somewhere else for which the polarization is opposite ; 
thus it is that at a point many wavelengths away from the disturbance there is no 
resultant field. (For blue light, for example, for which the wavelength is 
4x 10-5 cm, there are about 1000 atoms to one wavelength, or about 10° in a 
cubic wavelength, so that the statistical fluctuations are small and the cancella- 
tion of individual dipole fields is complete.) 

When, however, our electron is moving very fast, or, to be more precise, ata 
speed comparable with the velocity of light in the glass, the picture is quite 
different, as shown in fig. 2b. In this case, owing to the time required for the 


Cerenkov Radiation: its Origin, Properties and Applications 49 


effects of the disturbance to spread through the medium (i.e. the effects of 
‘retarded potentials’), the polarization is no longer distributed symmetrically. 
There is still axial symmetry, but there is no longer symmetry along the axis, 
with the result that even at distant points there is a resultant dipole field, and the 
region of glass in the immediate neighbourhood of the moving charge will 
behave as an elementary dipole, though only for a very brief instant of time while 
the charge moves through the region in question. Each elementary and tran- 
sitory dipole will radiate in much the same way as a tuned half-wave radio aerial, 
the emitted energy having an intensity distribution which is uniform in azimuth 
around the axis, and which varies as sin? @ where 6 is the angle between the axis 
and a particular direction of observation. 

So much for the fundamental process, in which we have considered only a 
single element of path. In practice of course the glass slab has considerable 
thickness and the elementary dipoles along the track will each radiate indepen- 
dently and in rapid succession. There will, however, be interference between 
the wavelets radiated by all these dipoles, and it is this phenomenon of inter- 
ference which leads to one of the distinguishing features of Cerenkov radiation. 
It is easy to show, from the simple Huygens construction of fig. 3a, that there is 


v 
Fig. 3. (a) Huygens construction to illustrate coherence; (b) the Cerenkov cone and the 
polarization vectors. 


only one unique angle of emission of the radiation for which the wavelets can 
combine in phase to produce a substantial resultant effect. When the radiation 
from the separate elements of track combine in phase, the radiation is coherent. 

Taking the velocity of light im vacuo c as a constant, it is convenient to 
express the particle velocity v in terms of c we write v=$c. Suppose now 
we consider the particle to travel at this constant velocity Bc between two points 
A and B in our slab of glass, in ashort time At, and suppose that in this same interval 
of time light generated at A travels along a line from A to C, inclined at some 
at present arbitrary angle 6 to the line AB. Since the velocity of the light in the 
glass is (c/n) we at once can write that AB=fc At and AC=(c/n) At, from 
which we obtain the fundamental expression, known as the Cerenkov Relation, 


namely that 
1 
cos = pe = ( ) (1) 


AC \Bn 
CPs D 


50 J. V. Jelley 


which implies that the radiation will be emitted in only one direction AC, and 
that this direction depends solely on the velocity of the particle and the refractive 
index of the medium; the wave-front of the radiation coincides with the line 
BC. Since there is axial symmetry, the radiation will occur not along a single 
line AC but over the whole surface of a cone, as shown in fig. 3b. For those 
familiar with Maxwell’s electromagnetic theory of light, it is worth noting that 
the magnetic and electric vectors are orthogonal to each other and to the direction 
of propagation; i.e. the electric vector E is everywhere perpendicular to the 
surface of the cone and the magnetic vector Hg tangential to the surface. 

If we look at eqn. (1) we find that there are two special conditions which lead 
to characteristic features of this type of radiation. First, there is a lower limit 
or ‘ threshold’ particle velocity below which no radiation can take place ; below 
this velocity the cone angle collapses to zero. This threshold condition occurs 
when 0=0, for which we obtain at once that Bmin=(1/n). The second result 
is that there is a maximum value of the Cerenkov angle @ for extremely high 
velocities v. Since v= Bc, when vc (the limiting velocity for a particle) 
8-1 and egn. (1) leads to the result that @ (max) =cos~ (1/n). 

We will now wish to know just how much light is produced and what its 
spectral distribution will be. The classical theory of Frank and Tamm leads 
to the result that the output of light W, for a path length /, is given by : 


aes i (1 : )vedy (2) 


e ) Be 


where vy is the frequency of the light wave (=c/A where A is the corresponding 
wavelength zm vacuo) and dy is the band-width, expressed also as a frequency. 
The factor (1 —1/6n?) appears within the integral because the refractive index n 
is itself a function of the frequency, n(v), due to dispersion. ‘The integration is 
carried out over the range of y for which the threshold criterion Bn>1 is 
satisfied ; in fact if m were not a function of frequency the energy output would 
be infinite! In practice all optically transparent media have absorption 
resonances in the ultra-violet (and some of them similar resonances in the infra- 
red) ; in these spectral regions n falls below unity and radiation in these regions 
cannot occur. ‘he radiation is thus emitted as a continuous spectrum, the 
majority of the energy appearing in the visible region, some in the infra-red and 
a little in the microwave and radio-frequency bands. 

Over a limited wave-band we can consider constant, in which case, since 
cos = 1/fn, the factor (1—-1/6?n”) is equivalent to sin?@ and may be taken outside 
the integral. It is significant that this factor sin? @ is just that for the angular 
distribution of radiation from a single element of the track, as discussed earlier. 

The spectral distribution function can now be derived directly from (2) and 
may be written either in terms of frequency v, or in terms of the wavelength A : 


aW dW 1 
ee Vv. SE eens (3) 


These expressions reveal at once that the radiation is concentrated towards the 
short-wave end of the visible spectrum; the glow around the core of a high 


power water-moderated or ‘ swimming pool’ reactor, which is due to Cerenkov 
radiation, has a very characteristic bluish-white colour. 


Cerenkov Radiation: its Origin, Properties and Applications SI 


If c.g.s. units are used, the radiation yield W in (2) will be in ergs. It is 
frequently more convenient to express W in terms of the number of photons 
produced per unit path length, a photon being the smallest indivisible ‘ packet ’ 
of energy in a light-wave, a concept that derives directly from the quantum 
theory. A single photon of frequency v has an energy hy, where h is Planck’s 
constant (6-62 x 10-?’ erg. sec) so that an energy W may be expressed in terms 
of the number of photons N, by: W=N.hv. In this form, (2) may be reduced 
to a practical formula for calculating the photon yield per unit-path : 


Aare fel eal 
N=——.| —-—— }. sin?0 
he E | sin (4) 


over a band extending between wavelength limits A, and A,. 


5. SOME NUMERICAL EXAMPLES 


Suppose that we have three types of singly charged particle, electrons, p- 
mesons (a constituent of cosmic radiation) and protons, and we wish to work out 
the characteristics of the Cerenkov radiation produced when these particles pass 
through 1 cm of water or 1 cm of air. Since the radiation characteristics depend 
on the velocities of the particles, it is necessary first to derive the kinetic energies 
which correspond to these velocities, for each of these particles. For this we 
use the relativistic expression for kinetic energy, namely 


Bene (oss -1) (5) 


where m, is the rest mass of the particle. It is convenient, furthermore, to 
express rest energies m,c” and kinetic energies E in terms of a unit called the 
electron-volt, or ev, where lev=1-6x10-Yergs. The electron-volt is the 
amount of kinetic energy an electron acquires in falling through a potential of 
one volt. 

With the basic knowledge that mc” for an electron is 0-511 x 108 ev, and that 
the masses of a u-meson and proton are 209 and 1840 times that of the mass of 
an electron respectively, we can use eqns. (1), (4) and (5) to obtain the results 
we require. These results are tabulated below ; water is typical, as far as 
refractive index is concerned, of most liquid and solid media, while air is a very 
tenuous medium in its dielectric or refractive properties. 


Feature Water Air 
Refractive Index 1333) 1:00029 
@ (max) 41° ise 
N (max) i.e., when B = 1, for A 3500-5500 A| 210 photons/em 0:25 photons/cm 
Threshold energies : 
electrons Mass me 260 kev* 21 Mev 
u-mesons Mass 209 me 52 Mevt 4-3 cevt 
protons Mass 1840 m¢ 480 Mev 38 Gev 


*1 kev, a kilo-electron volt. +1 Mev, a million electron volts. {1 Gev, a giga electron volt = 10%ev. 


52 Ji VA fJelley 


From these results we see that for a given type of particle, the higher the 
refractive index the lower the threshold energy and the larger the cone angle 
and the light production. We also see that for a given refractive index, while 
the velocity remains constant, the threshold energy increases with particle mass. 
It is these features of Cerenkov radiation which have led to the development of 
Cerenkov detectors as extremely useful tools in nuclear and cosmic-ray physics ; 
some of these applications are discussed briefly later. Since, however, the pri- 
mary aim of this article is to explain the fundamentals of the subject we will now 
turn to a number of points which require further explanation. 


6. IONIZATION ALONG THE TRACK 


As the particle flies along through the medium, it will collide with atoms on 
the way and produce considerable ionization. In terms of energy loss, the 
ionization process indeed dominates over the Cerenkov effect. For a relativistic 
particle, the energy-loss due to ionization amounts to about 2 Mev/cm of path 
in water, this being ~2000 times that lost by Cerenkoy radiation. In most 
fairly pure substances, however, very little of this ionization energy appears as 
light, so that the Cerenkov effect is dominant as far as emission in the optical 
region of the spectrum is concerned. There are of course substances which 
produce quite large light yields from the ionization process; such materials, 
called ‘ scintillators ’, have themselves proved to be also very useful for particle 
detection, and the scintillation counter is indeed an extremely versatile tool in 
the physics of high-energy particles. 


Any light which is radiated in association with the ionization process will 
appear at a time, after the original ionizing event, determined by the particular 
life-times of the atomic energy states responsible for the radiation. For this 
light there is therefore no coherence between wavelets from different portions 
of the track, and thus no velocity threshold or characteristic cone; such light 
therefore will be emitted isotropically. 

One frequently encounters the misconception that Cerenkov radiation is in 
some way associated with ionization, and the above discussion has been de- 
liberately included to clarify the differences between the two processes. This 
misconception moreover can be easily dispelled by a consideration of the energy 
exchange taking place between the particle and a single individual atom involved 
in the Cerenkov process. Suppose the particle to traverse a slab of dielectric 
material of thickness x, see fig. 4. Asa result of the polarization, the atoms will 
be disturbed out to a distance ~A, the wavelength of the emitted radiation. 
The volume of material affected will therefore be 2x and the number of 
atoms involved will be 7A?xp where p is the number of atoms per unit volume. 
‘Taking the number of photons produced in this slab to be N.x, where N is 
obtained from (4), we find this corresponds to an energy W=Nvx.hv where hv is 
the energy of a single photon. The energy exchange between the moving 
particle and an individual atom is thus AW=Nw.hv/ 7m? xp=N.hv| 7A? p, 
Working only in round figures, for blue light, we have hy x2 ev, A=4x 10-5 cm, 
N=200 photons/cm, and p210?8atoms/cm, from which we calculate 
AW x8 x 10-"* ev, This minute energy exchange is however far below the level 
required to excite the atoms to any of the quantum levels above their ground 


Cerenkov Radiation: its Origin, Properties and Applications 53 


state, so that no excitation or ionization is involved in the process. More- 
over, referring to our simple picture of the atomic polarization in fig. 10, the 
energy exchange is so small that the deviation from atomic sphericity would 
be quite imperceptible in this sketch. 


W 


Fig. 4. The effective number of atoms involved in the energy exchange between tlie 
particle and the medium. 


7. OTHER RADIATION PROCESSES 

There is really very little similarity between Cerenkov radiation and other 
radiation processes, for example the production of x-rays when electrons 
impinge on metal targets, or the production of y-rays by the process of Brems- 
strahlungt which occurs when a charged particle passes close to an atomic 
nucleus and is deflected by the strong Coulomb field. 

Both these processes, and a third known as synchrotron radiation, which is 
produced when a swiftly-moving particle is deflected in a magnetic field, are all 
due to the charge undergoing an acceleration or deceleration. In Cerenkov 
radiation the particle is assumed to be moving at a constant velocity ; in fact 
this is imperative if the radiation is to be coherent. 

A second difference is that in the production of x-rays and y-rays (by 
Bremsstrahlung), most of the radiation is emitted with a quantum energy 
comparable with the kinetic energy of the moving charge, and there is no 
coherence. In other words, in these two processes, the energy loss of the 
particle occurs from relatively widely spaced collisions in each of which a large 
energy transfer occurs, whereas in Cerenkov radiation a vast sea of atoms is 
involved, the ‘ collisions ’ are ‘ remote ’ and the energy loss is gentle and smooth. 

A third and fundamental difference is that in these other radiation processes, 
it is the particle itself which causes the radiation ; if the particle is moving at 
relativistic speeds the radiation will carry away much of the forward momentum 


+ From the German, ‘ bremsen’ meaning to brake, and Strahlung meaning literally 
streaming, or radiation. ‘The term thus implies radiation by the process of the slowing 


down of charged particles. 


54 JEN lelley, 


of the ‘ collision ’, so that the radiation will be bunched into a relatively narrow 
beam. The solid angle over which most of the radiation takes place under 
these conditions is ~ (mc?/E) where mc? is the rest energy of the particle and E 
is its kinetic energy. In other words the higher the energy the narrower the beam. 

In Cerenkov radiation, the reverse situation arises; the higher the energy 
the wider the cone angle. The basic reason for this is that in the Cerenkov 
effect, although the particle may be moving at a relativistic speed, and the energy 
is indeed supplied by the particle, the radiation itself comes from the polarized 
atoms in the medium (fig. 1b), which atoms are fixed in the medium and do not 
take part in any bodily motion. This is why eqn. (4) contains the factor 
sin? 6, the radiation pattern of a stationary but oscillating dipole, but does not 
contain a factor like (mc?/E) which appears in the Bremsstrahlung radiation 
equations. 


8. RECOIL EFFECTS 


In the above description of the Cerenkov effect, we have assumed the particle 
to be moving in a straight line at a constant velocity. In practice, the other 
effects which have already been mentioned are however taking place simul- 
taneously, and each of them (ionization, Bremsstrahlung and Coulomb scatter- 
ing) will modify the direction and velocity of the particle to some extent. Never- 
theless, even in their absence, there is a further effect which is directly connected 
with the radiation mechanism, and that is the effect the radiation field itself will 
have on the particle. In classical theory this is known as radiation reaction, and 
in the quantum theory it is equivalent to a recoil. Ginsburg (1940) was the 
first to produce a quantum treatment of Cerenkov radiation, and his results show 
that both the coherence condition (1), and the light yield (2), have additional 
small correction terms which arise from this recoil effect. The effects, however, 
are extremely minute and his conclusions cannot be tested experimentally. It 
is not surprising that the effects are small, because the momentum of the emitted 
photons (hv /c) is negligible compared with the momentum of the particle; in a 
typical case the particle might have a kinetic energy of say 100 Mev (i.e., 108 ev) 
while the emitted photons have energies of only a few ev. 


9. APPLICATIONS OF THE CERENKOV BEPECT: 


Since the light output is so small from single particles, and it is single 
particles that usually are of greatest interest to the nuclear physicist, the develop- 
ment of successful particle counters based on the Cerenkov effect had to await 
the advent of the photomultiplier, the most sensitive light detector known. 

Following the author’s first experiment with counting single cosmic-ray 
particles by the Cerenkov radiation produced in distilled water (1951), Cerenkov 
detectors of various forms began to appear rapidly, particularly in work carried 
out in laboratories in the U.S.A., by Mather in Berkeley (1951) and Marshall 
in Chicago (1952, 1954), see also Jelley (1958). 

Cerenkov detectors may be used in a variety of ways, as will be seen from 
eqns. (1) and (2). For instance, over a limited energy region, it is possible to 
measure the velocities of charged particles, to some considerable precision, by 
measuring the cone-angle for the emitted light. An instrument using photo- 
graphic recording is shown in fig. 5 which was used for determining the energies 


Cerenkov Radiation: its Origin, Properties and Applications 55 


of the protons emerging from the synchrocyclotron at Berkeley, California. 
The proton beam passes through a thin glass ‘ radiator’ in which the Cerenkov 
radiation is produced, and a fraction of the light-cone is reflected from the rear 
of the glass and is focused on a film in the camera ; the prism is included to 
compensate for the dispersion in the Cerenkov radiation. 


ALUMINIZED SURFACE 


vUTTTy 


Nucut TIGHT BOX 


35MM FILM 


CAMERA 
| 
| INCH 


Fig. 5. Mather’s photographic instrument used for accurate measurements of proton 
velocities. 


Cerenkov detectors may be used, in conjunction with other types of particle 
counter, to select particles within a pre-set and limited range of velocities. 
Such a detector was used by Chamberlain and others (1955) in the historic 
discovery of the anti-proton. Anti-protons are particles having the same mass 
as a normal proton but which have a negative charge of the same magnitude as 
that of anelectron. ‘The negative or anti protons were produced in the Bevatron 
accelerator in Berkeley, U.S.A., by bombarding a copper target with normal 
protons having an energy of ~6cev. In this velocity selector, fig. 6, the 
cone of Cerenkov radiation emitted by the particles travelling down the axis of a 
plastic cylinder, emerges after refraction from the end of the cylinder and after 
reflexion from a cylindrical mirror, is focused to a point further along the optic 
axis of the system. The Cerenkov radiation is detected by one or more photo- 
multipliers. To avoid the effects of the particles themselves passing through 
the photomultiplier, and to improve the discrimination, the emergent light was 
divided by subsiduary mirrors and collected by three phototubes operating in 
an electronic coincidence system. The velocity is selected by varying the post- 
tions of the plastic radiator and the photetubes from the plane of the cylindrical 
mirror (to vary the angle at which the light may be intercepted), and a central 
baffle serves further to restrict the angles. 

One of the most important applications of Cerenkov detectors is their use as 
‘threshold’ detectors. Thus, for a given type of particle, it is possible to 
record just those particles which have a velocity above the threshold for the 
Cerenkov radiation in the particular material used, and to reject particles having 
a velocity below the threshold. In nuclear physics techniques, it is often useful 
to operate such a counter in time-coincidence with other detectors mounted in 
the particle beam. It is also possible, by connecting such a Cerenkov detector 
in anti-coincidence with other counters, to deliberately note events which are 
due to particles having a velocity below the threshold, by recording signals from 


56 J. Ve Jelley 


the other detectors which were found to occur when no signal appeared in the 
Cerenkov detector. 

Another and equally important application of the threshold characteristics 
of the Cerenkov counter is the discrimination between particles of different 
type or mass. We see, from (5), that the velocities of particles of different mass 
are different, for the same energy. ‘This is extremely useful in work on the 
modern large accelerating machines, where it is often required, for example, to 
count protons in the presence of large backgrounds of 7-mesons and vice-versa. 


CYLINDRICAL 
MIRROR 3-in. PHOTOMULTIPLIER 


CERENKOV i Sigg eit eee ee ee 


| lIBLACKeNeD PARTICLES 

RADIATOR BAFFLE 
| ARRANGEMENT OF THE 
THREE PHOTOMULTIPLIERS 
*(0) 4b) 


Fig. 6. Chamberlain and Wiegand’s instrument, using three photomultipliers, developed 
to select particles over a narrow range of velocities: (a) side view; (b) photomulti- 
plier system. 


Cerenkov detectors have also been used extensively in cosmic-ray research ; 
two examples will be cited. Simple Cerenkov counters, carried up in high- 
altitude balloons, have been used as direction selectors to determine the 
cosmic-ray albedo of the earth’s atmosphere, that is the ratio of the number of 
particles travelling downwards from outer space, to the number coming upwards 
from the atmosphere below. A second and equally important application is 
that of determining the charge spectrum of the incoming primary radiation, also 
carried out in high-flying balloons. If Ze is the charge on an incoming primary 
particle, the intensity of the light flash is in fact not, as shown in eqn. (2), just 
proportional to e?, but to (Ze)’, for eqn. (2) applies to particles of unit charge, 
i.e. electrons and protons. Recordings then of the relative numbers of light 
signals from the Cerenkov detector of different intensities, lead to a knowledge 
of the charge-spectrum, i.e. the relative numbers of helium, lithium, beryllium, 
etc., nuclei present in the cosmic radiation. ‘This, together with the cosmo- 
logical abundance of the elements, has a bearing on the still unsolved problem 
of the origin of cosmic-rays. 

For an up-to-date review of Cerenkov detectors the reader is referred to 
the recent work of Hutchinson (1961), 


10. CERENKOV RADIATION IN THE ATMOSPHERE 
In the Cerenkov detectors so far described, the radiation is produced in a 
small ‘ radiator’ situated in the instrument itself. On a much larger scale, the 
earth’s atmosphere can itself behave as a vast dielectric medium, in which 
Cerenkov radiation may be produced by cosmic-ray particles as they plunge 
down to the earth’s surface in a manner akin to a rain of charged arrows. In 
general, however, the particles are too widely spaced, in extent and in time, for 


Cerenkov Radiation: its Origin, Properties and Applications 57 


the resultant light to be detected above the general background light of the 
night-sky. However, in a cosmic-ray air shower, a great torrent of electrons 
(of up to 107 particles or more) sweeps down over a relatively small area of the 
ground, the flood lasting for only the briefest period of time, ~10-8sec. In 


LIGHT ARRIVING FROM THE LIMIT 
OF THE FIELD OF VIEW 


ee eet ee MEAN LIGHT 
INTENSITY 


TO. PULSE 
AMPLIFIER 


TO EHT AND 
DYNODE SUPPLY 


LAMP 
CONTROL 


7/0:5 6O cm DIA 
PARABOLIC MIRROR 


(SILVERED ON FRONT 
LIGHT SCREEN 
ae AIR TO OR BACK) 


PREVENT DEW 


Fig. 7. The light receiver used by Galbraith and Jelley to study Cerenkov light flashes 
from the night sky associated with cosmic-ray air showers. 


these events, using very fast recording equipment, the accompanying light flashes 
are easy to detect with relatively simple equipment. Galbraith and the author 
(1953) were the first to discover this phenomenon, for which they used the 
simple equipment shown in fig. 7, consisting of a photomultiplier mounted at 
the focus of a two-foot diameter parabolic mirror, the phototube being connected 
to a fast electronic recording system, and in coincidence with a conventional 
cosmic-ray air-shower array of Geiger counters. This equipment counted air- 
showers at a rate of a few per minute on a clear dark moonless night. References 
to subsequent work in this field may be found in a recent review paper, Jelley 


(1959). 
REFERENCES 


CERENKOV, P. A., 1934, Dokl. Akad. Nauk SSSR, 2, 451; 1937, Phys. Rev., 52, 378. 
CHAMBERLAIN, O., and WIEGAND, C., 1956, CERN Symposium, Geneva, Vol. 2, p. 82. 
Frank, I. M., and Tamm, Ic., 1937, Dokl. Akad. Nauk SSSR, 14. (3), 109. 

Ga.praltH, W., and JeLiey, J. V., 1953, Nature, 171 (4347), 349. 

Ginspurc, V. L., 1940, Fourn. of Phys. USSR, 2, 441. 

Hurtcuinson, G. W., 1961, Progress in Nuclear Physics, 8, 195 (London: Pergamon Press). 
Jeviey, J. V., 1951, Proc. phys. Soc. Lond., A, 64, 82. 

JEEGEY;~J.. V., 1958; Cerenkov Radiation and its Applications (London: Pergamon Press). 
JELLEY, J. V., 1959, The Fournal of Planetary and Space Physics, 1, 105. 

MA tet, L., 1929, C. R. Acad. Sci. Paris, 188, 445. 

MarsHatt, J., 1952, Phys. Rev., 86, 685; 1954, Ann. Rev. Nuclear Sct., 4, 141. 

Maruer, R. L., 1951, Phys. Rev., 84, 181. 


The Author: 

Dr. J. V. Jelley (B.Sc. Birmingham, Ph.D. Cambridge) is a Principal Scientific Officer 
at A.E.R.E., where he has been since 1949. Previously he had worked on airborne radar, 
on the early atomic energy project (at Montreal), and on the radioactivity of light elements. 
Most of his work at Harwell, where he is now studying microwave methods of polarizing 


nuclei, has been on Cerenkov radiation. 


Some Aspects of the Infra-red Absorption Spectra 
of Semiconductors 


by D. H. PARKINSON 
Royal Radar Establishment, Malvern 


SUMMARY 


The four chief features to be expected in the infra-red absorption 
spectrum of a semiconductor are:—the intrinsic edge, lattice bands, impurity 
bands, and free carrier absorption. The first two of these, which have been 
explored with high resolution, are discussed in detail. Measurements of 
the absorption spectrum near the intrinsic edge in germanium and silicon 
show that indirect transitions take place, and that ‘ excitons ’ are formed; and 
the intrinsic energy gap can be deduced accurately. ‘The lattice bands yield 
the main features of the lattice vibration spectrum. 


1. INTRODUCTION 


The last decade has seen a great development in the use of semiconductors, 
particularly germanium and silicon, as materials for electronic devices. This 
has stimulated an interest in their fundamental physics, and particularly in the 
properties of the electrons responsible for conduction processes. ‘The arrange- 
ment of the energy levels which these electrons can occupy is the first information 
needed. This must be followed by a knowledge of the transport properties 
and how these are influenced by the crystal lattice and impurities in it. One 
approach is to investigate the infra-red absorption spectrum, using the highest 
resolving power and carrying on studies at low temperatures. This review 
deals with the nature of the information which can be extracted; it will not 
include experimental techniques. Inevitably, special reference will be made 
to germanium and silicon, for these two semiconductors have been the most 
thoroughly studied. 


2. ELEMENTARY IDEAS; BAND STRUCTURE AND ENERGY GAP 


In the atoms of germanium and silicon all the electrons are in closed shells 
except for the four valence electrons. In the solid state these elements have 
diamond crystal structures with homopolar bonding; there are four tetrahedral 
bonds to each atom, corresponding to the four valence electrons. Figure 1 
illustrates in an elementary way the behaviour of these four electrons per atom 
in the crystal. Here the electron energy is plotted vertically and the two shaded 
areas AA! and BB! represent two bands of energy levels which are available to 
the electrons. 

We will first consider conditions at absolute zero. In a pure semiconductor 
at absolute zero the lower band BB!, the valence band, contains all the electrons, 
which means that all the homopolar bonds are completed. The energy gap 
AB is known as the ‘ forbidden’ or ‘ intrinsic’ gap, AE,; and the upper band 
is the conduction band. When an electron is excited to the conduction band a 
homopolar bond is broken and the excited electron is free to move through the 
crystal, so taking part in conduction processes. The broken bond left behind, 
which is called a ‘ hole’, can also move through the crystal because an electron 


Some Aspects of the Infra-red Absorption Spectra of Semiconductors 59 


from an adjacent satisfied bond can move in to fill this ‘ hole’ and create another 
‘hole’ nearby. In fig. 1 such holes are in the valence band, where they are 
equivalent to positive charges and where they can take part in conduction 
processes. ‘The forbidden energy gap in which there are no electron energy 
levels occurs because electrons have. essentially a wave nature and whether 


A 
CONDUCTION 
CTRONS BAND 
ENERGY DONORS 


IMPURITY LEVELS 


i] * XX XXXXxXKXXXKXX ae ACCEPTORS 


ie 
CONDUCTION BY HOLES VALENCE BAND 


B ai oe We, Ve pe 
Fig. 1. The simple idea of a semiconductor. 


they are in conduction bands or valence bands they are immersed in the potential 
field created by the atoms of the crystal lattice; this field is spatially periodic, 
exactly as are the atoms through the crystal. The motion of the electrons is 
mathematically analogous to the problem of the propagation of waves in a 
periodically loaded transmission line (Brillouin 1946), which leads to a low-pass 
band separated from a high-pass band by a stopping band. The stopping band 
corresponds to the intrinsic or forbidden energy gap. 

As the temperature is raised from absolute zero, electrons can be excited 
across the intrinsic gap and at room temperature there will always be some 
electrons in the conduction band, with corresponding holes in the valence 
band. When monochromatic radiation in the visible or near infra-red is passed 
through the semiconductor, the photons have enough energy to excite electrons 
from the valence band to the conduction band and a high absorption coefficient, 
a, can be expected (4 to 10 x 103 cm~). As the wave length increases, the photon 
energy (« =fv =hc/d) falls and will become less than AEg, consequently « will 
also fall rapidly, as shown in fig. 2, forming the ‘ intrinsic edge’. At longer 
wave lengths the residual absorption is due to the free carriers (electrons or holes) 
left in the semiconductor at the particular temperature at which the spectrum 
is measured. This absorption is only a few cm~ in pure semiconductors at 
the shorter wavelengths, but continues to rise through to the microwave region 
of the spectrum. Optical components such as lenses and prisms can be made 
from pure germanium for use in the near infra-red region of the spectrum. 

Atoms of impurities are always present, and they introduce ‘ impurity levels ’ 
in the intrinsic gap of the diagram in fig. 1. The position of these levels depends 
on the nature of the impurity atom and whether it is present substitutionally 
or interstitially. Those such as phosphorus, from Group V in the periodic 
table, have five valence electrons outside closed shells, and enter the lattice 
substitutionally. There is one electron more than is necessary for the tetra- 
hedral bonding ; this electron can be detached easily from the parent atom and 


60. D. H. Parkinson 


is given to the conduction band. Consequently Group V impurities give rise 
to levels near the top of the forbidden gap, and such levels are known as 
‘donors’. Similarly Group III atoms, entering the lattice substitutionally, 
will create one unsatisfied bond per atom. Such bonds are satisfied by electrons 
from nearby completed bonds, so that holes are created. The corresponding 
energy levels are known as ‘ acceptors ”’ and lie near the bottom of the intrinsic 
gap. Other impurity atoms, e.g. copper, can give rise to more than one impurity 
level. 


INTRINSIC EDGE 


NVELOPE OF 
SPECTRUM 


INDIRECT TRANSITION 
REGION 


——-— FREE CARRIER ABSORPTION. 
waeesnne~ =A LATTICE BANDS. 
sae asianes - IMPURITY BANDS. 


Fig. 2. Components of the infra-red spectrum of a semiconductor. 


The presence of impurity levels will normally be seen in the absorption 
spectrum. At long wavelengths, where the photon energies are much less than 
AE,, electrons will be excited from donor levels to the conduction band and 
from the valence band to acceptor levels. According to the nature of the 
impurity and the amount present, absorption lines, which may be quite wide, 
will occur at wavelengths ranging from 10m to 140m or beyond, and with 
intensities as high as 100 cm“, or so low as to be difficult to detect (fig. 2). 

The spectrum is further complicated because the crystal lattice itself can 
absorb infra-red radiation. ‘The normal thermal vibrations (phonons), can 
induce transient electric moments on the atoms of the lattice. Photons can 
interact with these and give up energy directly, through them, to the lattice. 
To conserve the energy and momentum, several phonons may have to be 
included in the interactions—(photons have energy and virtually no momentum 
whereas phonons carry momentum but have low energies in comparison with 
photons in the near infra-red). Such absorption by the lattice gives rise to 
* lattice bands ’ which are very widely spread and complex in structure, extending 
from the near infra-red, ~ 10, to ~100 or beyond. In silicon and germanium 
their intensities vary from as high as 50 cm— down to almost unobservable 


Some Aspects of the Infra-red Absorption Spectra of Semiconductors 61 


levels; in other semiconductors such as the 3-5 compounds the intensities can 
be considerably stronger. 

There are then four distinct features of the infra-red absorption spectrum 
of a semiconductor, whether it is silicon, germanium or the more complex 
compound semiconductors such as indium antimonide: the intrinsic edge, 
lattice bands, impurity bands and free carrier absorption. The first two are 
concerned with the intrinsic pioperties of the semiconductor and will be 
discussed in an introductory way in the next sections. Fan (1956) has given a 
comprehensive review of work earlier than that which is discussed here. 

The whole spectrum can be quite a complicated affair, as shown in fig. 2, 
with very large variations in the intensity of the absorption. Water vapour bands, 
extending throughout the whole wavelength region involved, complicate the 
measurements. Although they are very useful for calibration purposes, it is pre- 
ferable to evacuate the apparatus to remove all water vapour in the radiation path. 
The free carrier absorption is very dependent on the quantities of impurity 
present owing to the low excitation of many of them (~-Olev.). Figure 2 
illustrates to some extent the need to control the impurity concentration in the 
experimental specimens and research of this kind of course requires adequate 
crystal growing facilities. 


3. [THE INTRINSIC EDGE 


It should in principle be possible to derive the intrinsic energy gap, 
AE,, directly from measurements of the intrinsic edge. However, the measured 
edges vary considerably in shape, and under high resolution have revealed 
detailed structure which at first impedes the deduction of AE,, but which can 
yield a great deal of additional information. 

For this discussion, we require a more complete electron energy band model 
for the semiconductor. 

The behaviour of electrons in the periodic potential field of the crystal can 
be dealt with using quantum mechanics, but the mathematics is severe. A first 
crude approximation is to regard the crystal potential field as a well, or box, of 
side L,, L,, L3, with impenetrable walls, fig. 3a. The application of the 
Schroedinger equation to the behaviour of electrons in such a box is dealt with 
in introductions to quantum mechanics. ‘The solutions (eigenfunctions) are 
a set of plane waves of the form ¥;(r) =A exp(zk.r); where A is a constant 
depending on the volume of the box, r is the position vector, and k, known as 
the wave vector, is such that / k=p, where p is the momentum. ‘The energy 
corresponding to each solution is E =h® k®/2m, where m is the electron mass. 
The vector k has components 

hey =(2n7/L,) m3 Ry =(27/Ly) mgs Re =(27/Ls)ns (1) 
where 7, m, and mg are the natural numbers (0;4+1; + 2. ..), and k =(kz2+k,? 
+k,?)¥2, This result leads to the concept of constructing a momentum space 
based on p, or a ‘ k-space’ based on the vector k. Using kz, Ry, Rap eas the 
rectangular coordinates, all values of k are represented by a lattice of points 
corresponding to eqn. (1). For a real crystal L is large and the values of k 
corresponding to solutions of the wave equation are very close together. For 
practical purposes, the energy can be regarded as a continuous function of k, 
which for a given direction in k-space is a parabola as in fig. 3b, ‘This model can 


62 D. H. Parkinson 
be made to fit some of the properties of electrons in the crystal by using a value 
m* in place of the free electron mass, m; m* is an ‘ effective’? mass (which can 


be different from m by even a factor of 10), into which is lumped the effect of all 
the characteristics of the crystal ignored in the model. 


i I 


(a) (b) 


(d) (e) 


Fig. 3. Energy/wave-vector relations for electrons in given potential fields. 


Taking the periodic crystal potential field (fig. 3c) into account, Bloch 
showed that the solutions of the wave equations are of the form: 


be p(t) =Ux (r). exp (7k. r) (2) 
which is a plane wave modulated by a function U;, ar), which has the periodicity 
of the potential field (and hence the crystal lattice); k is still the wave vector, 
and corresponding to each value of k there are a number of different solutions 
each with a different energy, which are distinguished by the subscript f. The 


Some Aspects of the Infra-red Absorption Spectra of Semiconductors 63 


energy E, as a function of k, now has the character shown in fig. 3d, where 
E is no longer a continuous function of k, gaps occurring in the permitted values 
of EF at definite values of k. Now the wave functions, eqn. (2), should not 
change by translations in the crystal lattice between corresponding points in a 
unit cell. If a,, a,, a,, are the three primitive translation vectors 

U(r). exp(¢@k.r) =U(r+as). exp (dk. r+as) (3) 
where s=1.2.3... Such an identity follows if U(r) has the periodicity of the 
lattice, and if exp(ckas) =1, or k=27/as. Thus the solutions are repeated if 
k changes by K =2z/as, and all acceptable solutions of the wave equation can 
be described by confining k to a ‘reduced zone’ such that — 7 <kgsas <7. 
The values k =7/as are also just those for which the discontinuities in E occur. 
Replotting fig. 3d in the reduced zone, by moving those parts of the curve 
lying beyond + z/as by multiples of 27/as into the zone, gives fig. 3e. This is 
the relation between £ and k for one direction only. 

Taking all directions in k-space the number of different discrete values of k 
lying in the reduced zone is equal to the number of unit cells in the crystal and, 
as the electron spin has been neglected, two electrons can occupy each state 
represented by one k value. If the form of the branches, such as 4A! and BB! 
in fig. 3e, is known, it is possible to calculate the density of states as a function of 
E, integrating over all directions in k-space. It may then be found that the 
highest energy states corresponding to a branch, such as BB!, are lower in 
energy and well separated from the lowest states in the next highest band, AA}. 
This situation is the first condition for the crystal to be a semiconductor and is 
the basis for drawing the simple diagram of fig. 1. The second condition is, 
obviously, that the number of states below the energy gap must be just sufficient 
to contain all the electrons, so that those above are empty at the absolute zero. 
BB’, in fig. 3e, then corresponds to the valence band and AA! to the conduction 
band. If there are not enough or too many electrons to fill the energy states just 
to the top of a band without filling any states in higher bands, or if the bands 
overlap, the crystal has metallic properties. 

The calculation of the functions which represent the branches of fig. 3e, 
even in the simplest crystals, is a difficult and complicated matter and can only 
be carried out by approximate methods. The chief problem is to find a satis- 
factory choice for the form of the crystal potential for each electron. Com- 
prehensive reviews of the current theoretical situation have been given by 
Herman (1958) and Callaway (1958). In real crystals the minimum in the 
conduction band and the maximum in the valence band may be at the centre of 
the zone, k =0, as is shown in fig. 4a; this is now known to be the case for 
indium antimonide and the intrinsic gap, AEZy, is measured as shown in the 
figure. In germanium the minimum in the conduction band occurs at the zone 
edge in the (111) direction, with a subsidiary minimum at the zone centre, and 
the intrinsic gap is measured as shown in fig. 4b. In silicon the minimum in the 
conduction band lies in the (100) direction and only part of the way across the 
reduced zone. In other semiconductors the maxima and minima could both be 
off centre and in different directions. For many purposes the form of the band 
functions near the maxima and minima is all that is required. 

The absorption of radiation due to interactions with electrons only can now 
be considered. The probability of the transition of an electron from a state 
wn’ o(r) to a state prc(r), (v for valence, c for conduction), can be written in 


64 D. H. Parkinson 


terms of a matrix element from which it can be shown easily, that the probability 
is non-zero if the wave vectors k and k’ are connected with q, the wave vector 
of the incident radiation, by 
k’-k+q=0 (4) 
The wave vectors are related to the electron momenta and eqn. (4) is expressing 
the conservation of momentum in the transition; q is very small indeed com- 
pared with k, i.e. photons carry very little momentum, and (4) reduces to 
k’=k. Thus, in figures such as 4a and 44, only vertical or ‘ direct ’ transitions, 
e.g. AB or BA, can result from the direct interaction of radiation with electrons. 
For the band structure of fig. 4a, AE, should be calculable from the position of 
the foot of the intrinsic edge. However other absorption processes, such as 
that due to free carriers, obscure the foot of the edge and its position could only 


Z 


CONDUCTION 
BAND. 


COM iniery ee i 
AE, 


aa i ean 


VALENCE 
BAND 


i, O 
' k 
Fig. 4. Typical £/k curves for given crystal directions. 


be found reliably if the shape of the edge were known. A curve of the roper 
shape could then be fitted at higher levels of absorption where the other bees 
are relatively small. In principle the shape can be calculated by ee th 

transition probabilities over the states near the maximum and nee f 
the two bands. For the simplest case, the relation between E and k is pinabolie 


Some Aspects of the Infra-red Absorption Spectra of Semiconductors 65 


(see Lax 1958), and two distinctly different forms of the edge then occur. The 
absorption coefficient « is either « (hv—AEy,)/2 or « (hv—AE,)3/2, In the first 
case the transitions are said to be ‘ allowed’ and are between states for which 
the wave functions have different symmetry; in the second case the transitions 
are said to be ‘ forbidden’ (see McLean 1960). It should therefore be possible 
to find AF, and, from the shape of the edge, distinguish between ‘ allowed’ 
and ‘ forbidden ’ transitions. 

However, the real situation is more complicated, and account must be taken 
of the Coulomb interactions between the electron raised to the conduction band 
and the hole left in the valence band. This attraction gives rise to states in 
which the electron and hole are bound together as a pair, in a manner similar 
to the electron-proton pair of the hydrogen atom. In such excited states, an 
associated pair can move as a unit through the crystal, and it is known as an 


ANO® 
«(cm 
3.10° 


89 Ae) ‘90 


Fig. 5. The direct excitation in germanium, after Macfarlane et al. (1958). 


‘exciton’. The energy required to form an exciton is less than that required 
to produce free electrons and free holes. This is not a new idea, being first put 
forward by Frenkel as long ago as 1931, and formulated and developed for use 
in semiconductors during the last few years by Dresslehaus and by Elliott; 
see the review by McLean (1960). Elliott’s analysis shows that, both for allowed 
and forbidden transitions, the absorption in the intrinsic edge, hy>A£g, is 
much increased by the exciton formation tending to a constant value as 
hy>AE,. With allowed transitions, even at energies where (hv—AEg) is 
~ 100 times greater than the exciton binding energy, the absorption is increased 
by ~ 35 percent. As hy becomes less than AEy, the absorption merges into the 


Cshe E 


66 D. H. Parkinson 


envelope of the discrete lines corresponding to the excited states of the electron- 
hole pair. These lines, if resolved, would lie just below hy =AEy by energies 
given by E(x)/n?, where  =1.2.3. - - - and where E(x) is the exciton binding 
energy; for forbidden transitions n =2.3. - - - . Experimental evidence for the 
existence of excitons in indium antimonide, which has a band structure similar 
to that in fig. 4a, has not yet been produced, although spectra attributed to 
exciton formaton have been observed in cuprous oxide and in some compounds 


hv CeV) Germanium 
8 -O |-2 


Gie 207K 


ae Sia yak 


1O-= bh 2) 41-4. 06 tse - Oem to aee> Aurea 
hy CeV) Silicon 


Fig. 6. Comparison of the intrinsic edges of germanium and silicon. 


of large energy gap (e.g. CdS and HgI,). It is likely that the very steep 
intrinsic edge found in indium antimonide is the front of an unresolved exciton 
line. 

Excitons are also formed in the direct transitions in germanium which has 
the more complicated band structure of fig. 4b. The lowest energy at which 
direct or vertical transitions can occur corresponds to AE», which is considerably 
larger than AF,. Macfarlane, McLean, Quarrington and Roberts (1958) have 
investigated the absorption edge in germanium, using very high resolving power 
and very thin specimens, (~5 to 10 thick), with which measurements to 


Some Aspects of the Infra-red Absorption Spectra of Semiconductors 67 


absorption levels as high as 5 x 10 cms! can be made. They showed that at 
low temperatures, a previously unobserved rather narrow peak occurs at 
~ 0-89 ev just at the top of the absorption edge, fig. 5. This peak and most of 
the attendant absorption in its vicinity are attributed to exciton formation in 
the direct transition, at k =0 in fig. 4b. The binding energy, E(x), is estimated 
to be ~-001 ev. The experimental results have been carefully analysed and by 
fitting the theoretical curves to the experimental ones at photon energies higher 
than the peak, values of AZ, have been deduced at a series of temperatures 
(from 20°K up to room temperature). Similar observations have also been 
made by Zwerdling, Roth and Lax (1958), but values of AE, deduced from their 
results differ by as much as -007 ev at the lowest temperature, although agreeing 
at room temperature. ‘This has been shown to be due to strain inherent in their 
method of specimen mounting. 

With silicon (fig. 6) the edge does not show a similar exciton peak. The 
conduction band structure of silicon is such that at k=0 there may be a 
maximum; in this case the binding energy of the direct exciton may be negative, 
and so it will not be observed. 

The edges of both germanium and silicon both show pronounced tails at 
the long wave low absorption side of the edge, which on detailed examination 
show distinct knees, fig. 7. The explanation involves exciton formation in 
indirect transitions. An indirect transition is one in which the electron is 
excited by a photon while at the same time interacting with a lattice vibration, 
or phonon, through which it gains or loses appreciable momentum. In this 
way transitions such as XY in fig. 46 can take place, though the probability of 
such a process is much less than that due to direct transitions. In the wave 
mechanical treatment the interaction with the phonons is dealt with by perturba- 
tion theory and the mathematical expressions become rather cumbersome. 
In principle an electron may be imagined to be elevated from a point such as X 
to an ill-defined (very short duration, by the uncertainty principle) virtual state 
at P due to the absorption of a photon, and then immediately making the 
transition to near the minimum in the conduction band by interaction with a 
phonon. Momentum and energy are conserved only in the overall process. 
Equally well an electron can be imagined as raised from Q to a state near the 
minimum in the conduction band, while the hole remaining at QO immediately 
makes the transition to X by interaction with a phonon. The lowest frequency 
at which an indirect transition can occur with the absorption of a phonon must 
then be given by Ayv=AE,—E, and with the emission of a phonon by 
hy =AEg+ Ep, where Ep is the phonon energy. ‘Theoretical expressions for the 
absorption due to both processes can be developed which appear to fit the shape 
of the absorption curves taken at low resolution near the foot of the intrinsic 
edges, both in germanium and silicon. This could be taken as evidence for the 
indirect transition process. However, indirect transitions alone cannot account 
for the presence of the knees shown in fig. 7, found in the considerably higher 
resolution. 

With indirect transitions also the Coulomb interaction between electron and 
hole, which leads to exciton formation, has to be taken into account. The 
lowest limit of photon energy for which an indirect transition can be observed 
is then Ey—E,-Ez', where Ez’ is the exciton binding energy. From the shapes 
of the components of the curves in fig. 7, and their relative positions, Macfarlane, 


68 D. H. Parkinson 


McLean, Quarrington and Roberts were able to show that a satisfactory inter- 
pretation could be obtained in terms of the emission and absorption of two 
different types of phonon, the energies of the phonons being the same in all 
curves. Only emission processes occur at the lowest temperature, for there are 
no phonons to absorb, and the shape of the emission components (two) could 
be found. At higher temperatures, the corresponding absorption components 
with similar shapes appear. From the positions on the energy scale at which 
these components begin, and also by correlating the shapes of the emission and 
absorption components, the phonon energies could be found. ‘The two energies 
involved with germanium were remarkably consistent in all curves, with equiva- 
lent temperatures of 90°K and 320°K corresponding to acoustic branches of the 
lattice vibration spectrum. 


6 


) 


m2 
un 


90°K 


(77 


i 


4-2°K & 20°K 


Ww 


nN 


COMPONENTS 
INVOLVING 
PHONON 


EMISSION: i/ 
xX if! 


COMPONENTS 
INVOLVING 
PHONON 


ins ee 


62 64 66 68 ‘70 


te 
SQUARE ROOT OF ABSORPTION COEFFICIENT, K? (c 


769 771 


72 TASINT6 UT Ee ea 
PHOTON ENERGY hy (eV) y 


Fig. 7. Low level absorption spectrum in germanium at foot of intrinsic edge, showing 
indirect transition region with exciton formation. The inset illustrates the accuracy with 
which the curves are defined. After Macfarlane et al. 


The excitons formed with indirect transitions can be represented by a band 
just below the minima in the conduction band in the (111) direction. For 
these indirect transitions in germanium the exciton binding energy was estimated 
as ~:‘006ev. A study of photoconductivity through the region shows that 
the photon efficiency, in producing electron hole pairs fully separated, is reduced 
in those parts where excitons are formed as would be expected. Once the 
points at which the components of individual absorption curves cut the 
horizontal axis are known, together with the phonon and exciton binding 
energies, values of AE, can be deduced. In germanium, AE, falls from 
~:742 ev at the lowest temperatures (4°K) to ~:665ev at 290°K, i.e. the 


Some Aspects of the Infra-red Absorption Spectra of Semiconductors 69 


intrinsic edge moves to longer wavelengths as the temperature rises. From 
other experiments it can be shown that the minima in the conduction band 
in germanium lie at the zone edge. The momenta of the phonons involved 
in the indirect transition must therefore be sufficient to carry electrons 
from the centre of the zone (k =0) to the minima, hence the momenta as well as 
the energies are known. In this way the branch of the lattice vibration spectrum 
involved can be identified. Originally there was some disagreement with the 
theoretical vibration spectra, but subsequently the lattice band analysis, and 
neutron scattering data have confirmed the phonon energies deduced from the 
intrinsic edge analysis. 

In silicon there is also conclusive evidence that indirect transitions take 
place with exciton formation. In this instance there was some discrepancy 
over the phonon energies originally estimated as compared with those deduced 
from the infra-red lattice bands which has been cleared up in a later analysis 
(see McLean 1960). 

The electron band structure of germanium and silicon is much more complex 
than can be inferred from this discussion, thus in fig. 4b in the valence band 
three bands are shown, two of which are degenerate at k =0, the third is split 
off by spin-orbit interaction and lies lower than the other two. For many 
purposes the two upper valence bands are all that is required and they can be 
investigated by cyclotron resonance experiments (see Lax 1958); one band 
gives rise to virtually isotropic effective mass, but the other is much more 
complicated in its behaviour. This present article has omitted many features of 
band structure, and such matters as the selection rules involving phonons in 
indirect transitions. It has also excluded other experiments such as those 
on the pressure dependence of the intrinsic edge, and studies of the intrinsic 
recombination radiation from silicon and germanium which lend support to the 
ideas given here. 

To summarise: By a detailed study of the intrinsic edge it has been shown 
that in silicon and germanium indirect transitions take place with exciton 
formation; and that with direct transitions, exciton formation occurs in 
germanium; AF, can be deduced accurately as a function of temperature; and 
AE, has also been found for germanium. 


4, LATTICE BANDS 


The absorption of infra-red radiation in ionic crystals such as sodium 
chloride by interactions with lattice vibrations has been well established, 
although there has been some doubt about the exact nature of the absorption 
mechanism. Ideas based both on anharmonic forces and on charge deformation 
by the lattice vibrations have been suggested. In either event, because of the 
strong ionic nature of the crystals, one can see intuitively that the absorption 
should be strong. In homopolar crystals such as germanium and silicon with 
no ionic charges the lattice absorption may be expected to be, and is, weak. 
Lax and Burstein (1955) have considered the mechanism involved and have 
produced evidence for believing that in homopolar crystals the absorption is 
due to charge deformation. But first it is necessary to outline the features of the 
lattice vibration spectrum. 

The lattice of a crystal can support both transverse and longitudinal elastic 
vibrations (at low frequencies these will be sound waves). In elementary 


70 D. H. Parkinson 


treatments, such as those leading to the Debye theory of specific heats, the 
crystal is considered as a continuum. ‘This is too crude an approximation for 
the present purposes. A full analysis of the propagation of lattice vibrations, 
taking into account the arrangement of the atoms and the forces between them, 
gives results which are similar to those already outlined for electron waves and 
is also mathematically complicated. However the essential features of the 
vibration spectrum can be found by considering a linear model, that is, a chain 
of atoms, distance d apart, arranged along the x axis. The coordinate of the 
nth atom will be x =nd+ 5, where 5, is its displacement from the equilibrium 
position. The model is worked out rigorously by Brillouin (1946). Here the 
following points only are needed: 6, can be regarded as a property associated 
with the atom n which can be propagated as a wave if the problem has a solution 
of the form 

bn =A exp 1 (wt—qn) (5) 
where A is a constant amplitude, w is the angular frequency, t the time, g =2z0d, 
and o is the wave number (1/A); d and n are already defined. The proof that 
eqn. (5) is satisfactory, and questions dealing with the boundary conditions, 
are dealt with by Brillouin. Here, q is the change in phase in passing from one 
atom, n, to the next, (n+1). From eqn. (5) 


8-41) = dn exp (—79) (6) 


and in this way q is defined as an angle. ‘Thus the same solutions of the problem 
will be obtained for g’ =q¢+2m7, where m is any integer, and the equations must 
yield the same values of w (or, o =w/27) for every equivalent g. Hence w or v 
are periodic functions of g or c. The period of g is 27 and of a is 1/d. It is 
sufficient to discuss the modes of propagation inside one period of o and the 
most convenient choice is usually —1/2d<a<+1/2d. This implies that there 
is a critical wavelength, 2d, and correspondingly a critical frequency v,, at which 
the vibrations are cut off. The results can be represented by plotting the 
frequency against the wave number as in fig. 8. For the monatomic linear chain 
transverse oscillations will be represented by the branches OC in the figure. 
The branches curve in the manner shown because the chain is dispersive (group 
velocity vgxdv/do and the wave velocity vxv/c). Longitudinal modes will 
travel at a different velocity and will therefore produce a second pair of branches 
such as OB. 

These ideas can be translated into three-dimensional lattices, with similar 
results, except that fig. 8 will represent the solutions in a particular direction 
in the lattice and o is now a vector corresponding to that direction. o is often 
called the wave-number vector or wave vector (cf. k used in the last section). 
As soon as the diatomic linear chain is considered, or in three dimensions there 
are two atoms in a unit cell, new branches, both transverse and longitudinal, 
such as PA, appear in fig. 8. As they occur at higher frequencies, such modes 
of oscillation are called optical modes; whereas branches such as OB are said 
to be acoustic modes. In fact, except near the origin, O, the frequencies involved 
are all in the near or far infra-red. The essential difference is that during vibra- 
tion in the acoustic branches adjacent atoms move in the same direction, while 
in the optical branches they move in opposite directions. Transverse waves can 
be resolved in two directions at right angles to give two independent sets of 
shear waves. In the most general case these two sets will be represented by 


Some Aspects of the Infra-red Absorption Spectra of Semiconductors a 


two different curves in fig. 8, but with an isotropic solid the two transverse sets 
will be degenerate and represented by a single curve. 

It is convenient to discuss the vibration spectrum in terms of fig. 8 and just 
as with electrons a three dimensional o space can be constructed in which 
surfaces of constant frequency (or energy) are drawn. In principle by summing 
the number of modes of vibration through such a space, surface by surface, the 
density of states can be calculated as a function of frequency. From such a 
function of frequency it is a comparatively short step to calculate the total 
energy and, by differentiation, the specific heat of the material (see Blackman 
1941). Proceeding along any branch to the zone edge, the number of states in 
a frequency range dv increases and peaks appear in the diagram of the density 
of states plotted as a function of frequency, which correspond to the acoustic 
and optical modes very near the zone boundaries. It is customary to speak of 


Vv 
734°K|P 
695°K 
595° K 
aes? x 8 B 
C 
200°kK c U, 
Zid ° 
> o WAVE VECTOR —= +g 


Fig. 8. Frequency/wave-vector curves for the lattice vibrations in a reduced zone scheme, 
showing acoustic and optical modes; the equivalent temperatures are those for silicon. 


the energies corresponding to points such as A and B in fig. 8 in terms of the 
equivalent absolute temperature, (E =k7). Here the symbol k is Boltzmann’s 
constant, and of course has this meaning also where kT occurs later. We must 
consider the vibrations quantised as phonons, which term has already been used 
and will be used freely in the following paragraphs. 

The interaction between radiation and the atoms in a crystal must arise 
through an electric moment in some way. In their analysis of the mechanism 
of lattice absorption, Lax and Burstein (1955) consider the electric moment of 
the crystal and expand it in terms of the electron wave functions which depend 
on nuclear position (i.e. the non-core electrons, the ‘ deformable’ electrons). 
Then, by suitable manipulation the expressions can be converted to a form in 
which the first, second, and higher order moments are recognizable. Using 
the condition that the crystal should be electrically neutral, as in silicon or 
germanium, they show that the first order moments are zero and that processes 
involving single phonons are ruled out. Interactions with radiation through the 


U2 D. H. Parkinson 


second moment are two phonon processes. The selection rules involved can 
be found by applying the symmetry conditions for the crystal; for germanium 
and silicon inversion symmetry is required. This immediately shows that two 
phonons involved must come from distinct branches of the vibration spectrum, 
fig. 8. Multi-phonon processes involving higher order moments can also 
occur. Focussing attention on the two phonon processes, there are two distinct 
types of interaction. First, the absorption of a photon and the emission of the 
phonons, and second, the absorption of a photon and a low energy phonon and 
the emission of a high energy phonon. The first type of absorption yields 
‘summation bands’ (bands at photon energies corresponding to the sum of the 
phonon energies) and the second yields ‘ difference bands’. 

The first experimental work in which these ideas could be properly tested 
was due to Johnson (1958) who examined the summation bands in silicon using 
much higher resolving power than previous workers. His results are shown in 
fig. 9, where the improvement in resolved detail and, in particular, the resolution 
of the three phonon bands has enabled an interpretation to be made. ‘The 
nature of the bands can be established by their temperature dependence. ‘Thus 
if phonons of wave number o, and o, are both emitted, the temperature 
dependence will have the form {(1+/F,)(1+F,)—F, F,}, where F= 
exp—(hco/kT—1). With difference bands the corresponding expression is 
{F, (1+ F,)—F, (1+), (o, being absorbed, and o, being emitted). ‘The peaks 
in fig. 9 (arrow) can be interpreted consistently in terms of three-phonon and 
two-phonon processes. ‘The valleys (circle) occur at places also expected from 
the two-phonon selection rules. ‘The characteristic temperatures assigned to 
the ends of the branches are shown in fig. 8. ‘The positions of the highest 
frequencies due to the two- and three-phonon processes, P and OQ in fig. 9, 
agreed in giving the energy of the optical modes at o =0 (the Raman frequency). 
The whole interpretation was supported by the close agreement between the 
experimental and calculated temperature dependences and relative intensities. 
Although the phonon energies derived from the original analysis of the indirect 
transition region of the intrinsic edge did not agree with these figures originally, 
neutron scattering data produced by Brockhouse (1959) confirmed the lattice 
band analysis remarkably closely. It must be emphasized that the experimental 
energies for the branches at the zone edge are mean energies over all directions 
in the crystal. 


An interesting result from this work is that the transverse acoustic modes 
have a maximum energy (200°K) which is very much smaller than that of the 
longitudinal acoustic mode (485°K). In consequence, as the crystal is warmed 
from absolute zero the transverse acoustic modes are excited first and are 
saturated before an appreciable number of other modes. Similar analyses by 
Johnson (1959) for germanium are well supported by neutron scattering data. 
The low energies associated with the acoustic modes in germanium explain the 
observed variation in the characteristic Debye temperature derived from specific 
heats. 

This work has also been extended to indium antimonide (Fray, Johnson 
and Jones 1960). This material which has the zinc blende (or substituted 
diamond) structure, so that inversion symmetry no longer exists, and the two 
phonon selection rules discussed earlier do not apply. An interesting result 
of the analysis is that the transverse acoustic mode is again very well separated 


Some Aspects of the Infra-red Absorption Spectra of Semiconductors 73 


from the longitudinal acoustic mode; this is likely to be so for all similar 

3-5’ compounds. With indium antimonide the crystal is partially ionic and 
powerful reststrahl reflections occur which make the measurement of some of 
the two phonon bands near the reststrahl frequencies very difficult. The only 
way of dealing with such a situation is to carry out transmission and reflection 
measurements simultaneously. With partially ionic crystals, there should be 
lattice absorption involving single phonons, corresponding to the region near P 
in fig. 8, in the reststrahlen region. A beginning of work along these lines has 
been published recently by Kleinman and Spitzer (1960). 

In principle, such band analyses can be carried out for any semiconductor 
sufficiently pure, and they should yield the main features of the vibration 
spectrum. At present, the only direct way of obtaining the actual shape of the 
branches of the spectrum (fig. 8) is from neutron scattering data. The neutron 
scattering experiments are not however applicable to indium antimonide owing 
to the high neutron absorbing cross-section of indium; so the lattice band 
analysis is at present the only direct way of estimating its lattice vibration 
spectrum. 


3 PHONON _ 
BANDS 


2 PHONON ___| || 
BANDS. 


OBSCURING 3 PHONON BANDS. 


RESIDUAL OXYGEN BAND 


Oy 20 AOn OO W486 45 1002 IPO 140523160 
WAVE NUMBERS (MM-') 


Fig. 9. The lattice summation band spectrum of silicon. After Johnson (1959). 


A model in which the outer electron charge cloud is regarded as a shell 
coupled to the nucleus and inner electrons, and also to neighbouring shells, 
has been used by Cochran (1959) for germanium. Electric fields can be induced 
in the crystal by lattice vibrations, and the mechanism for the lattice infra-red 
absorption is built into the crystal model ab initio. This model should be a very 
useful aid in extracting more information from the measured lattice bands. 


5. CONCLUSION 


Only two topics, the intrinsic edge and lattice bands, have been discussed 
in detail; but these suffice to give some insight into the amount of information 
obtainable from the infra-red absorption spectrum of a semiconductor. Not 
only is it possible to examine the properties of the electrons in the material 
fairly directly; we can also learn a great deal about the crystal lattice itself. 


74 D. H. Parkinson 


But these two topics are only part of the whole range of possibilities. Impurity 
centres profoundly alter the electrical properties of semiconductors, and an 
accurate description of their behaviour is essential in all work on which device 
design is to be based. Combined infra-red absorption spectra and photo- 
conductivity studies can distinguish directly between the excited and fully 
ionised states of the impurity. Much work has been done already along these 
lines, see Tyler (1959) and Kohn (1957). 

Impurities can influence the lattice bands, for they introduce electric charges 
into the lattice, distort the unit cell, and in silicon and germanium upset the 
inversion symmetry. When the impurity concentration is high (101% or 101° 
centres per cc) the distortion of the cells around an impurity must also upset the 
local band structure, a change which should be reflected in the intrinsic edge 
absorption. ‘This field is relatively untouched. 

Another recent study is the intrinsic edge absorption in a powerful magnetic 
field. Field-dependent structure is observed (the magneto-optical effect) which 
is allied to the resonant microwave absorption of free carriers in a magnetic 
field (cyclotron resonance), and to the absorption due to excitations from 
impurities to the conduction band, also in the presence of a magnetic field, see 
the review by Lax (1958). 


ACKNOWLEDGMENTS 


The Author wishes to acknowledge with pleasure many interesting dis- 
cussions with his colleagues, particularly Dr. F. A. Johnson and Mr. V. Roberts; 
also to thank Dr. T. P. McLean for permitting him to see his review while 
yet in typescript form. 


REFERENCES 


BiackMaNn, M., 1941, Rep. Progr. Phys., 8, 11. (London: The Physical Society.) 

BrRILLOUIN, L., 1946, Wave propagation in Periodic Structures (London and New York: 
McGraw-Hill). 

BrockuHouse, B. M., and Ivencar, P. K., 1958, Phys. Rev. 111, 747. 

BrockuHousE, B. N., 1959, Phys. Rev. Letters, 2, 258. 

CaLiaway, J., 1958, Solid State Physics, VII, 99. 

Cocuran, W., 1959, Proc. Roy. Soc. A., 253, 260. 

Fan, H. W., 1956, Rep. Progr. Phys., 21, 107. 

Fray, S. J., JOHNSON, F. A., and Jones, R., 1960, Proceedings of Prague Conference on 
Semiconductors (to be published). 

HERMAN, F., 1958, Rev. Mod. Phys., 30, 102. 

Jounson, F. A., 1959, Proc. Phys. Soc., 73, 265. 

Jounson, F. A., et al., 1960, Proc. Phys. Soc., 76, 939. 

KLEINMAN, D. A., and Spitzer, W. G., 1960, Phys. Rev., 118, 110. 

Koun, W., 1957, Solid State Physics, V, 258 (New York: Academic Press). 

WAXD el OOS eeUm Od whys o0smle Ds 

Lax, M., and Burstein, E., 1955, Phys. Rev., 97, 39. 

Macrartang, G. G. M., McLean, T. P., QuarrincTon, J. E., and ROBERTS, V 
1957, Phys. Rev., 108, 1377. 
1958, Proc. Phys. Soc., 71, 863. 

McLean, T. P., 1960, Progr. Semiconductors V, 55. (London : ‘Taylor and Francis.) 

Ty er, W. W., 1959, 7. Phys. Chem. Solids, 8, 59. 

ZWERDLING, S., Lax, B., and Ror, L. M., 1958, Phys. Rev., 109, 2207. 


2d 


The Wool Industries Research Association 


“Tt has become obvious that the interests of science and of industry are 
identical; that science cannot make a step forward without, sooner or later, 
opening up new channels for industry; and on the other hand that every 
advance of industry facilitates those experimental investigations upon 
which the growth of science depends.” T. H. Huxiey, 1887. 


These words preface the brochure issued by the Wool Industries Research 
Association for the opening of their new laboratories at Torridon, Leeds, on 
22nd March 1961, by the Rt. Hon. R. G. Menzies, Prime Minister of Australia. 

The main objects of research at Torridon are to improve both the natural 
product and manufacturing techniques, but W.I.R.A. has a notable scientific 
tradition. In 1944 two of its staff, Dr. A. J. P. Martin and Dr. R. L. M. Singe, 
were elected to the Fellowship of the Royal Society for their invention of 
chromatographic analysis and later shared the Nobel Prize for this achievement. 

The main purpose of the new building is to house pilot plants for technical 
research problems. ‘There are also laboratories for physical testing and 
general use. Some special problems being investigated are the measurement 
of friction between fibres, the use of servomechanisms and the possibilities of 
automation. 

For the Opening Day visitors a number of exhibits were set up in the block 
amongst the experimental plants. There was a demonstration of the tempera- 
ture distribution arising from “‘ underlay’ heating, in which electric heaters 
were embedded in a carpet underlay. Effective heating by this process needs 
about 10 watts per square foot. 

One of the factors affecting the quality of worsted is the regularity of the 
sliver and in order to investigate this the sliver is run over a tongue and groove 
roller from which the size variations operate a system of levers like that of an 
aneroid barometer. The variations can also be measured electrically by 
causing them to alter the plate separation of a capacitor. 


Puysics DEPARTMENT EXHIBITS 


‘ Static”. When wool is passed between rollers static charges are set up 
(fig. 1 and 2). In the demonstration wool was charged by passing it between 
metal rollers and then passed between porous rollers containing detergent 
which discharged it. It has been found that the best practical method of 
minimising the static effects is to dissipate the charge rather than to attempt to 
reduce or eliminate the charging, because the initial charges produced by 
double layer phenomena are large and the surface densities of charge very 
considerable. The three main methods of charge dissipation are: (a) conduc- 
tion through the textile to the point of contact with the textile machinery, 1.e. 
metal roller or guide; (6) by gaseous discharge through the air to the machinery, 
in which case particular attention must be paid to the disposition and shape of 


76 V. T. Saunders 


the surrounding conductors; (c) the action of antistatic dressings, such as the 
detergent demonstrated. 

It was emphasised that the effect of humidity on static is entirely explained 
by the electrical conductivity of the fibres as a whole. The fact that static 
often disappears at a relative humidity of around 65 per cent is due to the 
critical conditions of fibre conductivity, speed, and geometrical arrangements 
all being satisfied at about this point. It is not due to any deposition of liquid 
water on fibre surfaces, nor to any connection between humidity and the 
intrinsic mechanism of charge separation. 


Fig. 1. Surface fibres of wool sliver charged by the passage between rollers 
during the process of spinning a yarn. 


The belief that moist air conducts electricity, and causes the failure of 
electrostatic experiments conducted in a damp atmosphere, is also erroneous. 
Leakage over solid surfaces is responsible. Air at any humidity has a dielectric 
strength of about 30 kv/cm, and below this stress is a very good insulator; 
gaseous discharge merely limits the maximum charge density obtainable. . 

Although gas discharge automatically limits the charge a further reduction 
is obtainable by using the “ action of points ’’, that is to arrange the conductors 
in such a way that a local concentration of field is produced. 


The Wool Industries Research Association Ve 


The Mechanical Properties of Wool? 


The apparatus for studying the dynamic mechanical properties of wool over 
a frequency range extending from 2 x 10-° to 1:7 cycles per second is shown in 


fig. 3. 


Fig. 2. Wool adhering to processing machinery due to charge acquired 
during processing at low humidities. 


One end of the 30 cm long fibre is attached to a carriage which is oscillated 
sinusoidally about a vertical axis by a motor and gearbox. ‘The plane mirror 
M,, rotated by the movement of the carriage, deflects a light beam which records 
the position of the carriage in a horizontal direction. ‘The other end of the fibre 


Carriage 5 ae —S sy . 
Se ere ele 
| 


| Gear box 
| | 


ee 


Fig. 3. Fibre cyclic stretching apparatus. 


78 V. T. Saunders 


is attached to a cantilever spring S, the deflections of which are a measure of 
the load on the fibre. The deflections of the cantilever are recorded by a 
galvanometer mirror, M,, which deflects the light beam vertically. This mirror 
forms an image of a pinhole, placed in front of a galvanometer lamp, on the 
photographic plate P. As the carriage is moved to and fro, the spot records 
movement of the carriage horizontally and force vertically, forming a loop 
on the plate (fig. 4.) 


Fig. 4. Wool fibre cyclic load-displacement-loop. 


The cantilever spring is attached to a screw operated carriage, by means of 
which the constant extension can be varied. ‘The whole apparatus is enclosed 
in a box, in which the temperature is thermostatically controlled, and constant 
humidity can be maintained by means of saturated salt solutions. 

Since all the experiments are carried out with the same amplitude of move- 
ment of the carriage, the height of the loop along the load axis is a measure of 
the dynamic modulus and the width of the loop along the same axis is a 
measure of the loss modulus (the ratio of the out-of-phase component of the 
stress to the strain). However, because the end of the fibre attached to the 
cantilever suffers small displacements, the strain on the fibre differs slightly 
both in phase and amplitude from the carriage movement. Corrections 
depending on both components of the force, must therefore be made to Precis: 
ments obtained from the plates. 

The results are said to suggest that the effect of temperature on the dynamic 
mechanical properties of wool is mainly the result of plasticizing action. The 
effect of absorbed moisture is more complicated in that it involves more medion 
tion of the elastic elements, although the reduction of internal friction is still 
the major effect. ‘The results also show, for the loads used, that all the cree 
and relaxation phenomena observed are visco-elastic and ak plastic. ; 


The Wool Industries Research Association 79 


Colour Problems 

There was a demonstration consisting of about ten pieces of cloth dyed to 
give a range of shades in which the difference in colour between any one and its 
neighbours was insignificant but the difference between the extremes was 
considerable. A single moveable piece was available to be matched to one 
member of the series. Changes in the incident light called for marked changes 
in the matching member; it was stated that the changes which take place in the 
human eye with increasing age lead to abnormal matching. 


Fig. 5. Reduction of friction by roughening a surface. 


Radioisotopes 

The mobility of ions in films is being measured by the use of radioisotopes, 
e.g. S®5 and C14. The Chemistry Department is also making use of radio- 
isotopes in the work of elucidating the structure-arrangement of the 18 amino- 
acids which make up the keratin molecule. 


Friction 

Finally there was the nylon-metal friction paradox (fig. 5). The cylinder on 
the left was highly polished and that on the right was roughened by sand-blasting ; 
the fibre was of nylon; the system was in equilibrium. On slightly raising or 
lowering the central weight slipping set in on the roughened cylinder before it 
set in on the other! 


REFERENCES 


[1] J. A. Mepiey, 1954, Yournal of the Textile Inst. 15, 123. 


[2] F. L. WarsurtTon, 1959, Fournal of the Textile Inst. 50, 1. 
V. T. SAUNDERS 


The Proceedings of the 1960 Annual International 
Conference on High Energy Physics at Rochester, N.Y. 


The first Rochester Conference on High Energy Physics was held in 1951— 
a gathering of fifty U.S. experimental and theoretical physicists who met for 
two days in February, at the University of Rochester. ‘The moving spirit 
behind this was Professor R. E. Marshak. The atmosphere was one of a private 
club surveying the year’s happenings in experiment and theory in an easy, 
informal manner. From this rather humble beginning the Conference has 
grown, so that it has become, perhaps, the most important event in the Physics 
calendar. ‘The first seven Conferences were held at the University of Rochester. 
Then, the General Assembly of the International Union of Pure and Applied 
Physics (IUPAP) created a commission on High Energy Physics to organize 
the Conference which decided to rotate the 8th, 9th and 10th conferences among 
Western Europe (CERN), U.S.S.R. (Kiev), and U.S.A. (Rochester, 1960). 
The tenth Conference whose report is under review was held from 
August 25th to September Ist, 1960. It was attended by 350 physicists from 
30 countries. The proceedings cover 900 pages of print, as against 73 pages of 
the first conference. The volume is edited by E. C. G. Sudarshan, J. H. 
Tinlot and A. C. Melissnos (Interscience Publishers Inc., New York), $13.50. 

The first two days were occupied by four ‘ working ’ sessions meeting simul- 
taneously. ‘These were sessions (1) on strong interactions of pions and nucleons 
(experimental) ; (2) on strong interactions of pions and nucleons (theoretical) ; 
(3) on strong interactions of strange particles and (4) on weak interactions, 
with the authors presenting their work in short communications. On the 
remaining three days the Conference met as a whole to hear and discuss reports 
prepared by rapporteurs summarizing the high-lights of the four sessions. The 
system appears complicated, but it worked with seemingly effortless smoothness. 

But somehow the 1960 Conference did not come alive. The first 
Rochester Conference the reviewer was privileged to attend was in 1955. That 
was the Chew-Low year, when the first successful theory of low-energy pion 
phenomena was presented. 1956 was the Goldberger (dispersion theory) year. 
In 1957 the Yang and Lee parity phenomena provided all the excitement ; 
in 1958 (at Geneva) the (A-V) theory of B-decay was high-lighted; and in 1959 
(at Kiev) Mandelstam’s representation was the topic of the year. There was 
nothing comparable this time. 

Perhaps the most important communication to the 1960 Conference was the 
proof of Mandelstam’s conjecture (presented by Eden, Polkinghorne, Taylor 
and Landshoff of the Cambridge University) but it is such an abstruse piece of 
theory that no general excitement could be expected. The one really new 
experimental contribution—the discovery of the first 7/ resonance—came almost 
surreptitiously at the last minute. ‘There were no Rochester lions this year. 

The publication of the report two months after the meeting is a remarkable 
feat for which the Editors deserve every congratulation. It is a compendium 
of all that happened in High Energy Physics during 1959-1960, whether it was 
exciting in the sense I have talked about, or not. The rapporteurs’ summaries 
are a pleasure to read. ‘The Swiss printing is magnificent. Unquestionably 
every one seriously interested in High Energy Physics must acquire a copy of 
this report. 


A. SALAM. 


Clean Air 


by S. H. G. STEVENS 


1. INTRODUCTION 


The Clean Air Act 1956 aims at the minimizing of atmospheric pollution 
from the products of combustion arising from the inefficient burning of fuel, 
particularly bituminous coal. This article deals with the cause and effect of the 
pollution of the atmosphere from these sources. 

During the 19th century, developing industry brought various health 
hazards. ‘Those affecting the water supply and sewage disposal of an urban 
population were attended to. But air pollution was not. 

The pollutants are, first, the particulate matter heavy enough to fall out of 
the atmosphere; next the smoke particles suspended in the atmosphere for 
lengthy periods and, finally, the sulphur acids derived from the burning of any 
fuel containing sulphur. The DSIR publication ‘“‘ Measurement of Air 
Pollution ”’ discusses them in detail, and figures quoted in this article are based 
upon information from the Fuel Research Station. 

In order to obtain a representative sample of deposited matter, a collecting 
jar is strategically placed and left open for one month. 

Air is filtered through a section of a filter paper at specific intervals, darkening 
the filter; this is compared with a standardized colour chart and the degree of 
suspended impurity estimated. 


2. MEASUREMENT OF SULPHUR DIOXIDE 


Wherever fuels containing sulphur are burnt, whether efficiently or not, 
sulphur dioxide will be produced. 
There are two methods by which the presence of sulphur oxides may be 


determined: 


(a) The Lead Peroxide Method 

A porcelain cylinder having a surface area of 10 cm®, coated with standard 
lead peroxide mixed with gum mucilage, is exposed to the atmosphere for one 
month. 

The reaction with sulphur dioxide is: 

PbO, +SO, =PbSO, 

The paste is removed from the fabric, and after acidifying with hydro- 

chloric acid, the sulphate is precipitated with barium chloride solution. 


Alec Y 


82 S. H. G. Stevens 


(b) The Volumetric Method 
Air is bubbled through diluted hydrogen peroxide, with the following 
reaction 
HO, +50, =H,50, 
The acidity of the resulting solution is determined by titration with alkali 
and used to calculate the equivalent sulphur dioxide concentration in the air 
which is very much greater than concentrations of other strong acids or alkalis. 


3. THE CAUSES OF SMOKE 

(a) The Domestic Grate 

Coal at 500°F loses its moisture and the combustibles begin to oxidize. 
Around 700°F tarry and volatile hydrocarbons are evolved as a yellowish 
brown smoke when unignited. This is apparent when fresh coal is added to the 
domestic fire. In the early stages the smoke passes up the flue but with rising 
temperatures the coal begins to glow at 930°r, proceeding to a bright red at 
1550°r; when the tarry hydrocarbons ignite giving fairly smokeless combustion 
until the next stoking-up. During the period of bright redness, radiant heat 
may be enjoyed; there are, however, long periods during which there is only 
smoke. 


(b) The Steam Boiler Plant 


In the steam boiler the firebed and furnace temperature is in the range 
2000-2500°r, well above the ignition temperatures of the tarry hydrocarbons. 

At 1750°F all of the hydrocarbons will have been evolved and if the principles 
of complete combustion have been followed, they will burn smokelessly. 

Thus, after the evolution of the tarry volatiles come a variety of compounds 
of hydrogen and carbon of which methane CH, is the most important. This 
hydrocarbon burns at 1202°r, it has a very high heat release and if it passes 
into the boiler flues unburnt the result is heat loss and smoke. 


For smokeless combustion, then 
(1) There must be time to complete combustion in the hottest zone of the 
furnace chamber, for 
hydrocarbons are evolved rapidly and are easily chilled by the water 
cooled surfaces over or through which they pass, giving a soot- 
blackened smoke. 
(ii) ‘The hydrocarbons must be kept above their ignition temperatures. 


(ii) The mass of gas produced in the first few minutes after firing must 
receive ample oxygen to burn the combustibles. 


4, THE PROCESS OF COMBUSTION 


During combustion the following weights of flue gases are evolved per pound 
of combustible element: 


3-66 lb of carbon dioxide from the reaction of carbon with oxygen. 
9 lb of water vapour from the reaction of hydrogen with oxygen. 
2 lb of sulphur dioxide from the reaction of sulphur with oxygen. 


Clean Air 83 


The sulphur dioxide reacts under favourable conditions with the water 
vapour in the chimney flues and forms sulphurous acid. further, sulphur 
trioxide may be formed to give sulphuric acid. Similar reactions take place 
when these sulphur oxides are emitted into the atmosphere. The hydrocarbons 
when burnt completely produce carbon dioxide and water. If the hydrocarbons 
which have a high calorific value (table 1) are incompletely burnt or are chilled 
and pass into the atmosphere as smoke, much of this heat is lost. 

If we accept the fact that volatile hydrocarbons are the potential smoke 
producers, then in order to prevent the smoke from being formed and emitted 
into the atmosphere these gases must be burnt within the furnace chamber. 


Calorific Values 
Fuel Remarks 

Heat Release— 

B.Th.U. per Ib 
Hydrogen 61 340 These calorific values are gross. The net 
Garhonisurit ato heating value is defined in the report of the 
PA oamnonoide 4 350 Institute of Civil Engineers Heat Engines 
Trials Committee as the gross calorific value 

Carbon burnt to 


less the latent heat (reckoned at 1,055 


carbon dioxide 14 590 B.Th.U. per lb) of the water condensed 
Methane CH, 23 860 from the products of combustion by cooling 
Acetylene C,H, 21 590 them to 60°F. The smoke from unburnt 
J . d 
Bavieac'c i, 71 380 hydrocarbons represents considerable heat 


loss; the incomplete combustion of carbon 
Ethane C,H, 22 220 to carbon monoxide represents a heat loss of 
10 240 B.Th.U. per lb of carbon (coke). 


Table 1. Calorific values of hydrocarbons 


5. Tue Crean Air Act, 1956 

Smoke 

The first section prohibits the emission of dark smoke from chimneys. 
Industrial chimneys are controlled specifically by The Dark Smoke (Permitted 
Periods) Regulations, 1958. This section includes railway engines and ships. 

The ‘ permitted periods’ give some latitude in the emission of dark smoke 
when lighting up from cold and during soot blowing, according to the number of 
furnaces in an industrial plant and to the types of boilers on ships. The con- 
tinuous emission of ‘ dark smoke’ for more than four minutes, or of ‘ black 
smoke’ for more than two minutes in any thirty minutes from the industrial 
chimney and for more than three minutes in any thirty minutes from the 
chimney of a vessel is an offence. 

The density of smoke is measured against the shade cards of the Ringelmann 
Chart (fig. 1). 

The Act defines ‘ dark smoke’ to be as dark as, or darker than, Shade 2 on 
the chart. ‘ Black smoke’ is defined as dark as, or darker than, Shade 4. 


84 S. H. G. Stevens 


An alternative method of assessing smoke density uses a photocell (fig. 2). 
When the smoke exceeds a certain density an alarm operates. 
There is a prohibition on the installation of new furnaces unless they can 


operate smokelessly. 


No.1 
20% Black 


or 
li g ht grey 


No.2 

40% Black 
or 

-] Darker grey 


No.3 

60% Black 
or 

very dark Orey 


snaomuecee 
JSR BRBREBS No.4 


GaEBBEGnReBaeanaa 
80% Black 


or 
black 


No.5 
100% Black 


Fig. 1. The Ringelmann Chart. This is observed from a distance of 50 feet against the 
background of the smoke emission that is being tested. 


Section 10 is unique as it includes the sole reference in the Act to gases as 
well as smoke, grit, and dust in relation to the height of chimneys. ‘This section 
seeks dispersal of such emissions by adequate chimney height, considering the 
proximity of other buildings, the topography of the district, town planning and 
other relevant matters so that the public health may not be endangered or 
nuisance created. [he problem is largely one of safe dispersal of sulphur 
dioxide and reducing its ultimate concentration at ground level. 


Clean Air 85 


Smoke Control Areas and Domestic Smoke are dealt with in Sections 11 to 
15 of the Act and aim at the control of smoke within specified areas, emphasizing 
the installation of domestic grates designed for smokeless fuels. Experience 
shows that contemporary designs of buildings frequently provide for dwarf 


chimneys in the interests of symmetry, with little apparent regard to the objects 
of this section. 


Smoke 


CHIMNEY 


PHOTO ELECTRIC 
CELL 


PRoogttor: 


©) ALARM Bete or LIGHT. 


CONTROL UNIT AND 
PUOTOCELL AMBLIFIER. 


Eee Noicating Merer. 


oo Mains RECORDER. 
ALaRaM Soeery 7 = CONNECTION 
CONNECTION 


Fig. 2. Internal smoke density meter, fitted to chimney stack. 


6. THE EXTENT OF ATMOSPHERIC POLLUTION 


As an extreme example of actual damage to stonework one may select the 
deterioration of the masonry in the colleges and other buildings of the University 
of Oxford (fig. 3). It has been said that the damage to these buildings has been 
greater during the past 50 years than in the previous 500 years. 

The owner of the suburban home also realizes that decoration and renovation 
is a major item in the maintenance of house property. ale Beaver Report 
under the heading “‘ The Economic Cost of Air Pollution ” gives the following 
approximate annual dirt bill: laundry and household goods £25 m; painting 


86 S. H. G. Stevens 


and decorating £30m; cleaning and depreciation of buildings other than 
houses £20 m; corrosion of metals £25 m; damage to textiles and other goods 
£52 m; and the overall cost of air pollution is £150 m. 

In addition to these figures a further £100 m per annum Is estimated as 
representing ‘ indirect costs’ such as occur from ill-health, giving a total figure 
of £250m per annum. This sum does not include waste of fuel due to in- 
complete combustion estimated at £50 m per annum. 


Fig. 3. Atmosphere damage; from outside the Sheldonian Theatre, Oxford. 
[Copyright Peter Bowles, Oxford 


7. SMOKELESS FUELS 


Nothing in the Clean Air Act prohibits the burning of bituminous coal with 
a volatile content of 20 per cent and over; but such fuel on the ordinary type of 
domestic grate will produce smoke (fig. 4), hence those who live in Smoke 
Control Areas (fig. 5) must use Smokeless Fuels. These include gas, electricity 
and solid fuels with less than 20 per cent volatile content as may be found in 
domestic coke and low temperature carbonized cokes as well as anthracite and 
Welsh and Scottish steam coals. While the carbonized cokes contain some 
sulphur the process of converting coal to coke leaves behind about half of the 


sulphur oxides in the gas-works, so the percentage of sulphur in coke is about 
1-2 per cent. 


Clean Air 87 


Volatile Matter | Pixed Calonhe 
Fuel Ash | Moisture | (Hydrocarbon) | Carbon Value 
Dry Ash Free 

Non-Smokeless per cent| per cent per cent per cent 
High Volatile Coal 

(Bituminous) 6-13 0-6-13°5 30°1—37 + 60 14 100-15 900 
Smokeless 
Anthracite 5-4 1-5-3-0 4-5-9°5 86 15 250-15 600 
Low Volatile 

Steam Coal 5 0-7-3-0 9-6-20 80 15 600-15 950 
Coke 7-12 — 0:5-2-0 86-90 14 400 
Low ‘Temperature 

Carbonized Fuel 5-10 — 6-12 80 15 000 
Fuel Oils — — 98-100 87-90 18 000-19 500 


‘Table 2. Analysis of coal and smokeless fuels 


io. 4. Domestic smoke. Colwyn Bay on a winter’s day. 
ne [Copyright Ronald Thomson, Denbigh. 


88 S. H. G. Stevens 


Oil fuels may or may not be smokeless in use. Kerosene is usually burnt 
by means of a wick or in the perforated drum type of vaporizer; it is used in the 
domestic convector heaters in which combustion is, normally, smokeless. The 
products of combustion pass into the room from which they may be removed 
by natural ventilation. 

The gas and diesel oils used for domestic hot water and central heating boilers 
are also vaporized and burnt by fan-assisted draught. ‘These may or may not 
operate smokelessly but the products of combustion pass into a chimney flue. 

In the larger domestic installations, central heating systems and oil-fired 
boiler plant, the oil is atomized by steam or oil under pressure incorporated in 
the burner design. Smokeless combustion of oil under these conditions depends 
entirely upon efficiency of operation. Smoke is very easily produced by the 
burning of oil by reason of its high (98 per cent) hydrocarbon content. ‘The 
oil companies are producing oils from which the sulphur is removed during 
refining and this is an important step forward in the abatement of atmospheric 
pollution. 


8. ‘THE DESIGN OF THE DOMESTIC GRATE FOR THE COMBUSTION OF SMOKELESS 
FUEL IN SMOKE CONTROL AREAS 


It needs about 10 lb of air to burn 1 lb of coal for every 10 000 B.Th.U.’ 
in each 1 lb of coal; and usually about half the air reacts with the carbon, about 
half burns the hydrocarbons to carbon dioxide and water. The carbon on 
the grate needs air admitted under the grate bars passing upwards through the 
carbon or coke. ‘The hydrocarbons require an air supply over the grate. 
Many of the old-fashioned Victorian grates, as well as the familiar basket and 
fret have very little effective air control either under or above the grate, a feature 
that is rectified in modern grates designed to burn smokeless fuels. In the latter 
the grate stands higher from hearth level, the grate bars are widely spaced, all 
of which aims at more air under the grate. The front of the grate should be a 
sealed fixture, so that the air can be directed. Smokeless fuels contain much 


Fig. 5. Industrial smokelessness. In the Manchester smokeless zone. 


[Reproduced by kind permission of Elsam, Mann & Cooper (Manchester) Ltd. 


Clean Air 89 


less hydrocarbon than raw bituminous coal, thus the air supply over the grate 
is of less moment to efficient combustion. A reduction of the air flow over the 
fire can be obtained with advantage and may be achieved by restricting the throat 
of the chimney flue from the average of 9in. by 9 in. down to 9 in. by 4 in.; 
this reduces change of air in the room from about 7 to 8 changes per hour down 
to 2 to 3 changes which is generally sufficient for ventilation; draughts are 
reduced and the room becomes warmer. There are other improvements in 


the modern grate, but the basic principle remains the control of air supply 
required for combustion. 


(AiR 
_& CONTROL 


Fig. 6. A modern smokeless-fuel grate. Note the deep ash-pan space giving ample air 
Hon: beneath; the restricted chimney throat; and the deep fuel bed essential for 


proper combustion. 


90 S. H. G. Stevens 


9. TABLE OF STATUTES WITH SHORT BIBLIOGRAPHY 


STATUTES 
ATMOSPHERIC POLLUTION—GENERAL 

Public Health Act, 1936—Part III. Abatement of Statutory Nuisances which is a 
general duty of local authorities. 

The Clean Air Act, 1956. Makes provision for abating the pollution of the air, and 
while it is the duty of the local authority to enforce the provisions of the Act, the 
declaration of smoke control areas is permissive. 

The Dark Smoke (Permitted Periods) Regulations, 1958. Allows by regulation periods 
for the emission of dark and black smoke from chimneys other than the chimneys of 
vessels which are controlled by The Dark Smoke (Permitted Periods) (Vessels) 
Regulations, 1958. 

The Alkali, etc., Works Order, 1958. Made under The Alkali, etc., Works Regulation 
Act, 1906. This controls the discharge of noxious and offensive gases, smoke, 
grit and dust from industries which have special and difficult problems to solve 
regarding fumes and dusts from industrial processes. For example the iron and steel 
industry employ processes in which large volumes of iron oxide fume-laden gas is 
produced at high temperatures; electricity works using pulverized fuel have fly 
ash to cope with; gas and coke works, the ceramics industry and others are con- 
trolled through the alkali inspectorate appointed under the principal act. 


SMOKE CONTROL AREAS 


Smoke Control Areas (Authorized Fuels) Regulations, 1956. Enacts that anthracite, 
briquetted fuels carbonized in the process of manufacture, coke, electricity, gas, low 
temperature carbonization fuels and low volatile steam coals are the authorized fuels. 


The Smoke Control Areas (Exempted Fireplaces) Order, 1957. Allows liquid fuel to be 
used in Smoke Control areas provided the fuel burning equipment is so installed, 
maintained and operated as to minimize the emission of smoke. 


The Smoke Control Areas (Exempted Fireplaces) Order, 1959. Allows the burning of 
fuel other than authorized fuels in well-defined types of furnaces, notably those in 
which mechanical stokers are most suited. 


REFERENCES 


D.S.I.R., 1957, Measurement of Air Pollution. 

D.S.I.R., 29th Report on The Investigation of Atmospheric Pollution. 

H.M.S.O., 1954, Committee on Air Pollution, pages 49, 52. 

Smokeless Air, Journal of the National Society for Clean Air, Summer, 1958, p. 277. 
Be Climatological Aspects of Air Pollution, by R. S. Veryard, B.Sc., Meteorological 

ce. 

The Iron and Steel Institute. The Fulham—Simon—Carves Process for the Recovery of 

Sulphur from Flue Gases. Special Report No. 61. 


The Author: 


Is Deputy Chief Public Health Inspector to the Borough of Heston and Isleworth. 
Member of the Association of Public Health Inspectors. Lecturer at the Kingston 
Evening Institute for the Smoke Inspector’s Certificate of the Royal Society of Health. 


Colour and Perception : the Work of Edwin Land in the Light 
of Current Concepts. 


by M. H. WILSON and R. W. BROCKLEBANK 
(Goethean Science Foundation, Clent, Stourbridge, Worcs.) 


1. INTRODUCTION 


In November 1957 the New York daily papers carried reports of an ‘ Optical 
Mystery ’, discovered by Edwin H. Land, associated with Polaroid and the 
one-minute camera. ‘The accounts described how two black-and-white pictures, 
photographed from the same scene but through red and green filters, and 
projected with red light and white light, respectively, were superimposed on the 
viewing screen to give a picture in full colour—a blonde in a green sweater. 
No explanation was offered, either by Dr. Land or by others present. Land 
was even quoted as saying that conventional theories could not explain his 
results, which were in specific conflict with current explanations of light and how 
we see colour. The reports mentioned that the “ entire range of colors was 
produced, from blue and yellow to green, in addition to the expected pink 
shades ”’. 

Such demonstrations could have been explained in terms of current con- 
ventional theories. ‘They seemed to be merely elaborate photographic versions 
of the old experiment with the coloured shadows (Rumford 1794, Goethe 1810) 
in which vivid blue-green colours are perceived in a setting lit only with red and 
white lights. It was true that current theories had not been able to explain to 
everybody’s satisfaction just how such contrast effects come about—there was 
some doubt as to the relative importance of chemical balance between the photo- 
pigments in the retina, and the neural connexions between the receptors and 
the centres of consciousness—yet the effects were well known and there was no 
shortage of attempts to provide a model of the visual mechanism that would 
explain them all. 

When Land came to England, in May 1958, and showed his demonstrations 
to the Royal Photographic Society, he put forward his own explanation. The 
eye, he said, receives two (possibly more) geometrically identical but separate 
images of the scene, one in terms of longer wavelengths, the other in terms of 
shorter; each of these records is subjected, independently, to a normalizing 
process, and after all systematic differences have been removed, the remaining 
random differences are seen as colour. 

This theory closely parallels Land’s experimental procedure, taken in reverse. 
The random differences of colour in the original scene are selectively distin- 
guished by the red and green filters used in taking the pictures, all other differ- 
ences between the two projected images depending on the illumination of the 
original scene, the photographic contrast in developing the print, the brightness 
of the projecting lamps, and so on, which might be described as systematic 
rather than random. The two images are projected separately with lights that 
differ in spectral content, such that one could be said to contain essentially 


92 M. H. Wilson and R. W. Brocklebank 


longer wavelengths than the other. Thus the theory is quite a good abstract 
summary of the demonstration, but it could hardly stand as a complete theory 
of colour vision, which would have to account for all the known phenomena of 
vision, including the results of colour mixing and matching upon which the 
trichromatic theory is based. 

When Land stated his case in print (Land 1959 a, b), there came a flood of 
articles announcing the overthrow of classical laws of colour. For instance: 
“Every textbook dealing with color will have to be rewritten.” (Bello 1959.) 
But since this publicity in popular journals brought a considerable reaction 
from scientific quarters, Land has moderated his claims (Land L959:c). In fact; 
Land’s first account (1959 a) describes a series of logically connected experiments 
which produced some previously unrecorded results. 


2. LAND’s EXPERIMENTS 


In the first part of his paper (1959 a), Land describes twenty-two experiments 
done with a dual projector which can project two photographic transparencies 
simultaneously in register. Each beam from the incandescent tungsten source 
may be controlled independently for intensity by polarizers and for colour by 
filters. The transparencies were made with a beam-splitting camera fitted with 
a red and a green filter, so that each subject was recorded in a red separation 
and in a green separation. ‘The experiments are numbered as follows: 


1. Red and white light from the projectors, without slides, colour the screen red or pink 
or white as the intensities are altered. 

2. ‘The two separation positives, projected without filters, produce black-and-white 
images differing in lightness for respective areas. 

3. When the red separation is projected in red light, and the other added in white light, 
the picture (a portrait) ‘instantaneously . . . appears in full color: blonde hair, pale 
blue eyes, red coat, blue-green collar, and strikingly natural flesh tones’. 

4. If the room lights are turned on to flood the screen with light, and then snapped off, 

( there is no time lapse before ‘ the appearance of full color in the picture,’ nor do the 
5.< colours change if the room lights are brought up to a medium level so that the surround- 
ee in the room, lit by dim white light, can be seen at the same time. 

6. Ina desk scene, projected with red and white lights, white objects (a pencil, cigarettes, 
etc.) are distinguished from yellow (a pencil) and from pink (areas on a map), the 
green-blue colour of a large book is contrasted with the deep-blue stripe on an air-mail 
letter, and with the pale blue area of ocean on the map, while brown, green, black and 
red objects are also distinguished. 

7. Another portrait with auburn red hair and a green sweater is used to show that these 
colours do not depend on the size of the image as measured by the angular subtense 
at the observer’s eye, nor do they depend on what the observer expects to see, as is 

8.< evident when the colour filters in the projectors are reversed, giving a reddish sweater, 
greenish hair, and intense blue-green lips. 

9. A picture of a dozen objects set against a grey background is used to show that obser- 
vers (unless colour-blind) show marked consistency in naming the colours of the objects, 
including red, grey and dark blue, even though they would not know what colours 
the objects might be (except an orange). 

10. When the intensity of the two beams is varied independently, the colour names 
reported remain the same over a wide range of red- to white-light ratio, and if colour 
appears at all, it does so for all objects in the picture. 

11. With the same picture of objects on a grey field, the green separation is augmented 
by a superimposed duplicate, thus doubling the densities and contrast relative to the 
red separation; when prorected with red and white light, respectively, for the red and 
the double-green separations, the objects retain their original colour names. 


Colour and Perception 93 


12. Pictures of grocery packages and fruit are projected with various pairs of colours to 
show that objects retain their colour identity, even though the colour of the background 
may change slightly. Kodak Wratten filters are used as follows: 


Red Separation. Green Separation. 

No. 24 Orange Red. No filter. 

No. 24 Orange Red. No. 58 Green. 
13. Same: No. 70 Extreme Red. No. 73 Yellow, narrow band. 
14. Same: No. 73 Yellow, narrow band. No. 49B Blue Green. 
15. Same: No. 47B Blue. No filter. 


Spectral transmission diagrams, plotting density against wavelength, are reproduced 
for most of these filters. 


16. It is pointed out that the yellow No. 73 could be used with either slide of the pair of 
separations. 


The remaining six experiments are intended to show situations where the 
objects are not seen in the colours that Land had come to expect, even though 
many of the same ratios of red to white light are present in the pattern on the 
screen. 


17. Three identical black-and-white transparencies are used, two superimposed in one 
projector and one in the other projector, and projected with red and green filters; 
the resulting picture is ‘ uniform in color and nearly colorless.’ 

18. When a green separation is projected in white light, and red light from the other 
projector is added without a photographic slide, then whatever the intensity of the 
red light, ‘ there appears only a uniform wash of the colors in their classical mixtures ’. 

19. With two photographic step-wedges, one placed horizontally in one projector, and the 
other vertically in the other, so that there are 256 little squares on the screen, each 
with a different ratio of lights from the two projectors and arranged in a topographically 
regular array, ‘ the colors mix only in a classical sense, red and white giving a vague 
pink, and red and green showing some red, some green, and some yellow’. 

20. The same desk scene used in Experiment 6 is projected with the two images far out 
of register. Apart from the red, pink and white colours ‘ classically expected,’ there 
are only a few unsaturated suggestions of the new colours. ‘Then the two images are 
slowly brought together, and ‘ at the instant of perfect registration, the full gamut of 
color snaps into appearance ’. . 

21. If pictures of two different scenes are placed in the two projectors and shown simul- 
taneously, projected in red and white light respectively and covering the same part of 
the screen, the colours perceived are limited in the same way as the out-of-register 
picture. ; here 

22. Finally, if a negative image is placed in one projector and the positive from that 
negative is placed in the other, when they are projected together in red and white 
light respectively, and carefully registered, the colours perceived are “ only reds and 
whites, and a little pinkness ’. 


The second part of Land’s paper (1959 a) describes a new apparatus, a dual 
monochromator viewer, in which an observer looks into an eye-piece to view 
two images superimposed by means of a semi-mirror. Each image, from a 
photographic transparency, is illuminated by light from an incandescent lamp, 
which can be controlled for brightness with a resistance, and may be reflected 
either from a ‘ white flag’ to give ‘ white light fi or from a rotating diffraction 
grating to give monochromatic light. With this apparatus, different pairs of 
wavelengths were used to illuminate the photographic transparencies (presumably 
red- and green-separation positives as used previously), and in each case the 
colour perceptions reported were noted on a ten-colour list: red, orange, yellow, 
green, blue, purple, brown, black, white, and grey. The results are recorded 
on a diagram in which one wavelength is plotted against the other. 


94 M. H. Wilson and R. W. Brocklebank 


For example, green (530 mp) and blue (470 my) give a picture resulting in percep- 
tions of ‘ Everything except red, orange, and purple’, that is, at least one report each of a 
yellow, a green, a blue, a brown, a black, a white, and a grey. 

The full set of colour reports, excluding only purple, applies broadly to any pair where 
the red-separation is illuminated with yellow, orange or red light, and the green-separation 
with greenish blue, green or yellow light, unless both have yellow. 

With two near wavelengths, the picture is seen in a wash of colour, but this colour is 
‘so pale that it falls in an entirely different visual category from the color obtained on 
objects in images ’. 

The distance separating the two wavelengths before object-colours are reported varies 
along the wavelength scale, being particularly low (about 10 my) around 500 mp and 
again near 590 mp, and rising to about 50 my in the middle and at each end. (‘This is 
simply a measure of wavelength discrimination.) With wavelengths longer than about 
500 mp for both slides, it is found that the colours appear in ‘ correct hierarchical order ’ 
if the red-separation has the longer wavelength of the pair, but if both wavelengths are well 
below 500 mu the red-separation must have the shorter wavelength, or else the colours 
will be reversed, so that reddish objects appear greenish and vice versa. 

Various features about the region from 570 to 590 mp are noted: Firstly the high 
discrimination between wavelengths, as already remarked. Secondly, that when mono- 
chromatic light is used for one transparency and white light (i.e. light from the dimmed 
incandescent lamp reflected from a white flag) is used for the other, the colours will reverse 
as the wavelength of the monochromatic light passes across this region. When the ‘ white 
light’ is used with monochromatic light near 588 my, then ‘ there will be essentially no 
color in the image’. (This strongly suggests that the white light has an unusually low 
colour-temperature, about 1800°k, and is more or less interchangeable with yellow light; 
see also Land’s experiment 16.) 


Next Land gives some results of variations in intensity of the two lights, 
for four pairs of wavelengths. 


For instance, with the pair of wavelengths 560 my and 615 my, eight colour names 
are listed; one of these, Orange, is seen over a range of 100: 1 in the ratio of energy between 
the two sources, and varies from ‘ light orange to red-orange ’. 


Finally Land describes an apparatus in which pairs of photographic trans- 
parencies two feet square, superimposed by means of a semi-mirror, are illumin- 
ated by sodium vapour lamp light and incandescent tungsten lamp light respec- 
tively. In one such apparatus the incandescent light is covered with a red filter, 
and in a second it is covered with a green filter. Five more numbered experi- 
ments describe results obtained with either or both of these devices. 


23. With sodium light illuminating the green separation positive, and red-filtered tungsten 
light illuminating the red separation positive, the picture showing books, a cup and 
saucer, a pot-plant, and so on, the range of colour sensations reported includes ‘ green 
and some blue’. 

24. With sodium light for the red separation and green light for the green separation, in a 
picture of an assortment of groceries, the result is reported to be ‘fully colored, 
covering the range from red to blue-green’. 

25. Both devices may be set up side by side and viewed either in rapid succession, or 
from a distance so that both images are seen together, and ‘ the adjacency of the two 
displays does not destroy the color in either one of them’. 

26. ‘The room lights may be turned on, but ‘the colors of objects in the mirror viewer are 
as real as the colors of objects in the room ’. 

27. Even if the observer moves so that he may see the unmixed lights of the sources at 
the same time as the image, ‘ the colors of the image prove to be stable’. 


On this evidence Land based his theory of total images, and sought to prove 
; ; fat 
that ‘classical laws of color mixing conceal great basic laws of color vision ’ 


Colour and Perception | 95 


3. CONFUSIONS IN TERMINOLOGY 


Colour science spreads through several disciplines, such as physics, physiology 
and psychology, each with a distinct, but fairly well-defined usage, and many 
of its terms are found in common speech, where the usage 1s remarkably variable 
(Report of the Physical Society Colour Group, 1948). The word ‘ colour’ 
itself has at least seven different meanings in everyday life, yet the context 
usually makes plain which is meant. Land uses the word in many of these 
senses, but most often in the sense of a perceived quality in the visual field, 
representing a property of objects. The usage in physics and optics, particularly 
in colorimetry, is generally to restrict the meaning of ‘ colour’ to a measurable 
property of light. Indeed the Optical Society of America has decided upon a 
single definition of the term (OSA Committee on Colorimetry, 1953), and 


requires all authors submitting matter to their Journal to use it in this defined 
sense: 


“Color consists of the characteristics of light other than spatial and temporal inhomo- 
geneities; light being that aspect of radiant energy of which a human observer is aware 
through the visual sensations which arise from the stimulation of the retina of the eye.’ 


Obviously statements made about colour with this definition in mind should 
not be taken as referring to visual sensations or perceptions; such a misunder- 
standing could have led to conclusions about the purpose of colour mixing laws 
similar to those reached by Land. 

Land’s terminology is unorthodox, but some of his terms are carefully 
explained. ‘Thus his ‘ long record’ and ‘ short record’ stand for red separation 
positive and green separation positive, respectively, which are produced from 
negatives exposed through red and green filters. The red filter passes the longer 
wavelengths of the spectrum, and the green filter passes the medium band of 
wavelengths. The shorter wavelengths are passed by a blue filter, but Land 
does not generally avail himself of this aid to better colour reproduction, so the 
term ‘ short’ is applied to what might otherwise be called the “ medium record ’. 

Land explains that by ‘ achromatic’ he means with little or no colour, or 
with the colour restricted to a monotone or to a reduced gamut of colours, or 
even not fully coloured in a naturalistic manner. Achromatic usually indicates 
a complete lack of hue—no trace of any redness, greenness, and so on—but 
Land uses it in the sense of not indicating clearly the colours belonging to objects. 
Thus he reports ‘essentially no color in the image’ (as an ‘ Achromatic Situa- 
tion’) when none of the objects are perceived to be correctly coloured, even though 
the whole scene is bathed in colour. 

On the other hand some of his terms are not so clearly explained, and it 
only becomes apparent from the context that his usage is unorthodox. It is 
probable that this has contributed to the misunderstandings about his work that 
led to some of the excesses in early popular reports. For instance, he uses phrases 
like ‘ full gamut of color,’ or ‘ fully colored’ even when he admits himself that 
there are colours missing; ‘‘...the gamut is complete: ‘everything’ (except 
purple) is in the image”. The gamut he refers to is the list of ten colour 
names, and it is ‘full’ or ‘complete’ if there is at least one of each (except 
purple) in the image. This may still be a very restricted range of colours, in 
which, for example, the only blues present will be so dark that they are only 
just distinguishable from black; but it is hardly surprising that the phrase ‘ full 


96 M. H. Wilson and R. W. Brocklebank 


gamut’ might be understood to mean that Land’s pictures reproduced all 
possible colours in a scene. If this were so, what use is the extra third 
EXECOLG NT 

Again, Land has an unusual way of employing the term ‘ stimulus ’, which 
seems to mean for him one or other of the two projections, or the transparencies 
being projected, or the component of the synthesized image projected from one 
or the other lantern. He writes as though each projected image retained its 
identity in the final picture on the screen, so that whatever kinds of light he is 
using for projection, whether monochromatic, broad band, or white, and how- 
ever complicated the details of the picture, there still remain two distinct, but 
only two, ‘stimuli’. It is hard to avoid the conclusion that Land’s theory of 
total images depends very largely on the assumption that there are only two 
‘stimuli’ in the usual sense, whereas in fact the visual stimulus is a complex of 
innumerable small patches of light of variable composition. 

A final example; Land uses the word ‘independent’ to mean not fully 
dependent in a one-to-one relationship, rather as one might claim that the 
volume of a given amount of gas were independent of pressure, had one failed 
to take temperature into account. Thus Land claims that his results prove 
that colour is ‘largely independent’ of wavelength, although his own data 
indicate that the wavelength plays a very considerable part in determining what 
colours will be perceived in the scene, and his own conclusion (Land 1959 b) is 
that “‘ color in the natural image depends on the random interplay of longer and 
shorter wavelengths over the total visual field ’’. 

Allowing for the difficulty of developing terms to describe newly evolved 
concepts in an unfamiliar field, Land’s papers (1959 a, b) do make clear what he 
has actually done, and indicate what concepts he has formed about the nature 
of colour vision. 

Since Land has stressed his view that the ‘ classical theory ’ of colour cannot 
explain or predict his results, and that conclusions drawn from his experiments 
are “‘ diametrically opposed to the main line of development of color theory, 
which flows from Newton’s experiments,” it is necessary to outline the concepts 
elaborated during the development of colour science, and then to apply them 
to Land’s work. 


4. CoLouR MEASUREMENT AND COLOUR PERCEPTION 


If colour is taken purely as a sensation, something entirely subjective, it 
cannot be measured. Even if it is considered to be an objective property of 
bodies, it comes within the category of ‘ secondary qualities ’ which cannot be 
measured directly. Until the time of Newton there was no method of measuring 
colour, although there was already an extensive practical knowledge of colorants 
and colouring; indeed painters especially had observed how the colours of 
objects were influenced by different lightings, by the atmosphere, by contrast 
with neighbouring colours, and so on. 

Newton discovered a purely geometrical entity which correlated exactly 
with colour under certain conditions, so that one varied with the other, providing 
an objective basis for measurement; in his own words: “ 'To the same degree of 
Refrangibility ever belongs the same colour, and to the same colour ever belongs 
the same degree of Refrangibility.’ (Newton 1671/72). The necessary 


Colour and Perception 97 


conditions are more exclusive than Newton thought; the colours must be mono- 
chromatic, not mixed, viewed by an eye in a normal state, with standard sur- 
roundings, and the degree of refrangibility must be measured with respect to a 
given pair of media, for instance, air and water. Later the angle of refrangibility 
was replaced by another geometrical entity, a measure of length varying from 
about four to seven hundred-thousandths of a centimeter, and identified in the 
undulatory theory of light with the wavelength in air. This yielded a better 
correlation with colour since allowance could be made for differences in optical 
density, and it could be applied to colours produced by interference, diffraction 
or other means. 

Although belief in a one-to-one relationship between wavelength and colour 
is still widespread, based no doubt on Newton’s conclusion, a little consideration 
will show that is it wrong on both counts. 

Firstly, is it true that for each colour there is ever the same wavelength? 
To begin with, certain colours such as purple, magenta, crimson, maroon, brown 
black, grey and white are not found in the spectrum at all, and consequently 
can have no wavelength. Further, the simple linear scale of wavelength does 
not express the fact that the violet colour at one extreme end is more like the red 
at the other extreme end than are the intervening blue and green colours. 
(This fact accounts for Land’s ‘ short wave reversal’.) Thus the simple wave- 
length scale is quite inadequate to account for colours outside the limited con- 
ditions of the spectrum, and has shortcomings even within it. The solution 
was already indicated by Newton in his colour circle: the monochromatic colours 
must be mixed together to give other colours. But now we pass from paucity to 
excess. Colours can vary in three ways independently—in hue, in saturation, 
and in luminosity—giving a triple infinitude of different colours. The wave- 
length scale may be subdivided indefinitely to give a single infinitude of wave- 
lengths, and if these are to be mixed together, each may be present to a greater 
or lesser extent, independent of any other, giving an infinite infinitude of possible 
combinations. Thus the number of possible wavelength combinations, or 
spectral contents of lights, is infinitely more numerous than the number of 
possible colours, so that for each colour there must be an infinite number of 
possible wavelength combinations. These are known as metameric colours; 
a well-known example is found in the Rayleigh anomaloscope, where of two 
identical yellow colours, one consists only of yellow monochromatic light and 
the other is mixed from red and green monochromatic lights, and contains no 
‘yellow wavelengths ’ at all. dine 

Secondly, is it true that for each wavelength, or combination of wavelengths, 
there is ever the same colour? If the first question contained the essence of the 
problem of colour measurement—how to find an objective physical entity to 
correlate with a given colour—this question contains the essence of the problem 
of colour perception, namely, how to predict the experience of the observer 
when confronted with a given physical stimulus of visible radiation. Here 
again, there is a fairly reliable correlation between a colour specification and the 
colour perception relating to it if the eye is kept in a normal state, and the colour 
seen in isolation, for instance as a small light area in a dark surround. As 
soon as different states of adaptation of the eye are taken into account, or if the 
colour patch is seen in a wider context, there may be very large departures 


from the simple one-to-one correlation. 
Clk 


G 


98 M. H. Wilson and R. W. Brocklebank 


An example of the first type of departure is found in the negative after-image: if the 
eye is exposed for half a minute to yellow monochromatic light and then to red, the colour 
perceived will be magenta and not a ‘ proper ’ spectrum red, and if the eye is turned again 
to the ‘yellow’ light after a minute looking at the red, the colour perceived is 
green. 

An example of the second type of departure is found where a small self-luminous object 
such as a pocket slide viewer screen is seen first in an illumination of lower colour temperature 
and then in one of higher colour temperature. In the first case, for example dim shaded 
tungsten light, the viewer will be perceived as a cool bluish white in rather yellowish 
surroundings; in the second case, for instance daylight, the viewer will appear distinctly 
yellow, although it is in fact the surroundings that have changed, and the light reaching 
the eye from the viewer remains physically exactly the same as it was before. 


For the purpose of a colour measurement or specification, it is obviously an 
advantage if the specification can remain the same if the light itself remains 
unaltered. For this purpose all contrast effects, extremes of brightness or dim- 
ness, and other variations that upset the normal state of the eye, must be elimina- 
ted just as in finding a measurement of length all effects of parallax and perspec- 
tive must be eliminated. What is then measured is a property of light, as defined 
by the OSA Committee on Colorimetry, and can be treated as a specification 
of what the colour 7s, but by itself gives no information as to what the colour 
looks like. 

Another requirement for colorimetry is that all colours that look alike under 
the specified conditions should have the same specification, even though their 
spectral content may differ. In order that the number of wavelength combina- 
tions may equal the number of colours, the number of wavelengths to be mixed 
together is reduced from infinite to three, each of which may be varied in 
intensity independently of the others. ‘Thus any colour can be matched with, 
and represented by, a metameric colour mixed from three suitable monochromatic 
lights, each specified by its wavelength. The suitability requires only that no 
one of the three can be matched by a mixture of the other two; negative values 
may be necessary, and these are represented as in algebra by adding the amount 
to the other side, that is, some of one mixture primary may have to be added to 
the sample being matched. ‘The observer is never called upon to express the 
nature of his colour experience, to name what he sees, but only to determine 
that there is no visible difference between two adjacent, or alternating, patches 
of colour. ‘The two patches are then metameric and the sample can be repre- 
sented by the mixture of three primaries, and specified within this sytem of 
reference by three numbers, representing the intensities of these three lights, or 
by any agreed algebraic transformation of these quantities. 

‘The idea of matching other colours with a mixture of three spectral lights 
is due to ‘Thomas Young, the laws of addition and substitution of lights we owe 
to Grassmann (although they are implicit in Newton’s treatment), and the 
representation of the proportions of the three mixture primaries on a triangle 
originates with Maxwell. (See fig. 1.) Many other physicists have contributed 
to the refinement of the science of colorimetry, among whom Helmholtz, Koenig, 
Priest, Guild, and Wright are well known, until today we have in the 1931 
standard of the Commission Internationale de I’ Eclairage numerical data expressing 
the colour mixture characteristics of an agreed standard observer, thus elimina- 


ting even those variations that occur between different examples of human 
colour vision. 


Colour and Perception 99 


It should be stressed that the use of wavelengths for standardizing the three 
mixture primaries, or fundamental stimuli (700, 546-1 and 435-8 my in the 
CIE system), is only a matter of convenience. The method of colour measure- 
ment would work equally well if the fundamental stimuli were specified in some 
other way, perhaps as lamps with an agreed physical structure and performance 
shining through coloured glasses with agreed chemical composition and 
dimensions. Just as such lamps give colours that can be specified in terms of 
given wavelengths, so could monochromatic lights be specified in terms of these 
lamps, though it would be much more difficult to ensure that the lamps used 
conformed to the standard. The point is that the science of colorimetry has 
made it possible to specify colour, in the sense of a property of light, by using 
a three-figure formula, and without the need for lengthy tables or graphs of 
spectral composition, and that each such formula applies to all metamers what- 
ever their spectral composition might be. 


Magenta Yellow 


~ 


Black 


Fig. 1. Colour space of three dimensions, derived from the additive mixture of three 
primary lights. If the vectors representing the red, green and blue primaries have 
lengths indicating the maximum luminance of each light, then all possible combina- 
tions will be represented by points lying within the smaller solid, a distorted cube- 
cell. Direction from the origin (Black) indicates chromaticity (bearing = hue, declina- 
tion=saturation) and distance indicates luminance (quality and quantity of light 
respectively). By projection from the origin on to any suitable plane, the two 
dimensions of chromaticity may be represented in a plane diagram, the Maxwell 
triangle; since the dimension of luminance is not represented, all neutral greys will 
plot in the same point as white, browns and olive-greens with orange and yellow, 


and so on. 


100 M. H. Wilson and R. W. Brocklebank 


Some Requirements for a Theory of Vision. 

One of the first tasks of any theory of colour vision is to account for the facts 
of metamerism; an infinite manifold of objectively distinct wavelength combina- 
tions is reduced for vision to a triple manifold of colours. The simplest type 
of mechanism that satisfies this requirement is one containing three types of 
receptor, each with a different spectral sensitivity, and each responding to 
stimulation independently of the others. Such a mechanism must obviously 
be in agreement with the anatomical structure of the retina, but it is not known 
for certain whether there are three distinct types of cone cell, three distinct 
photochemical pigments in each cone, or perhaps only three distinct parts of 
each cone. What is certain is that a mechanism based on two receptor types 
only would be inadequate. 

But a mechanism which can thus reduce radiation stimuli to colour sensations 
is not sufficient to account for colour perceptions in a wider field, however well 
it may agree with perceptions confined to the viewing conditions of a colorimeter, 
unless it is supplemented by some sort of balancing mechanism which will 
relate and compare sensations temporally and spatially. The actual location 
of such a balancing mechanism is even more uncertain than that of the receptors; 
suggestions have varied from interactions affecting the sensitivity of the receptors, 
at one extreme, to purely mental, if unconscious, inference affecting the judgment 
of the observer at the other. It is possible that adjustments in the neural 
activity carrying the colour information can occur at all levels from the receptors 
up to the highest centres, where the effects of learning and previous experience 
are operative. The requirements for this adjusting mechanism can be expressed 
in a set of equations to be applied to the colorimetric specifications; perhaps the 
best example is seen in Judd’s formulae published over twenty years ago (Judd 
1940). These relate the specification of the colour patch in question to the 
average colour of the field of view, with factors allowing for relative areas, 
lightness levels, and patterns of eye-movements, and predict the colour appear- 
ance in terms of hue, saturation and lightness. Although these formulae were 
designed to cover all cases in which surface colours, or coloured objects, were 
perceived in a coloured illumination, they had not actually been applied to a 
synthetic situation in which such a setting is simulated by projected pictures until 
Judd came to appraise Land’s work (Judd 1960). Nevertheless, they give pre- 
dictions of colour perceptions in the given situations which are considerably more 
precise than Land’s rather vague colour names, as they include a quantitative 
expression of the lightness and saturation as well as indicating the hue. 

There are at least three distinct effects that must be taken into account 
before a colour specification can be converted into a predicted colour perception, 
but in each case, though the phenomena are well known, there is a great lack of 
precise data to aid the formulation; in general, only qualitative results are 
available. 

Firstly there is the effect of previous stimulation of the eye, in particular of 
the area of the retina including the point of regard. This is the effect operating 
in negative after-images, and must influence the perception whenever the gaze 
passes from one colour to another. Although Land believes that his results 
‘demand formulas which are . . . fully independent of time’ (Land 1960), the 
conditions of his experiments require the observer to look at different parts 
of the picture in succession. ‘Therefore successive contrast cannot be entirely 


Colour and Perception 101 


tuled out as a contributing factor, even though it may play no part in the im- 
mediate appearance of a picture (Land’s experiment 4). The phenomena of 
successive contrast are sometimes attributed to a mechanism of temporal induction, 
which would alter the signal emitted from a stimulated receptor-system as a result 
of previous stimulation, but there is no reason why there should be any time 
lag before such alterations began to be effective. Whatever the mechanism, 
the effect is to move the colour perception towards the colour complementary 
to that of the previous stimulation; for instance, after looking at a brilliant red 
patch, another patch that would normally appear a neutral grey will now have 
a distinct blue-green hue (Wilson and Brocklebank 1955). 

The second effect is similar to the first, and although it may be difficult to 
eliminate the first entirely, requires to be considered as a distinct effect: this is 
the simultaneous contrast with spatially adjacent patches of colour. Again the 
effect is to move the colour perception towards a colour that is complementary 
to that of the adjacent patch, so that what is normally grey will be perceived 
with a blue-green hue if it is surrounded by a brilliant red area. 

A particularly striking form of simultaneous contrast is seen in the experiment of the 
coloured shadows (Rumford 1794, Goethe 1810) ; where a white screen is illuminated by 
a mixture of light from red and white lamps, and an interposed object casts two partial 
shadows, the shadow lit only by white light looks blue-green. 

In Judd’s method these two effects are not considered apart, but due allow- 
ance is made for different methods of viewing, whether with fixed or roving gaze, 
in balancing the parts played by the colour patch in question and its surroundings. 
If the attempt is made to explain the colours in Land’s projected pictures by a 
mechanism of spatial induction alone, as Walls has done (Walls 1960), this requires 
only that one ascribes sufficiently large a weight to the induced colour; un- 
fortunately such a mechanism ought to produce equally varied colour perceptions 
from a situation such as is found in Land’s experiments 20 and 21, and this is 
not the case. In fact, the extent of the new colours produced in these situations 
is some measure of the contribution of simultaneous contrast to the total effect. 

These two effects, taken together, include all spatial and temporal variations 
in the field of view, but do not allow for the total effect on the observer. This 
third effect may be considered one of interpretation, or inference, or learning 
from past experience, but it is beyond question that the perception of colour 
may be influenced if the total scene is such that it presents the aspect of illumin- 
ated objects. The observer is then faced with the possibility of ascribing the 
colour wholly or partly either to the objects or to the illuminant. It is significant 
that in all Land’s experiments in which he gets results in conflict with his 
concept of classical laws of colour mixing, the observer is faced with a scene 
representing illuminated objects, and is asked to name the colours of objects 
quite specifically. 

It is not only a question of recognizing familiar objects, and knowing from 
past experience what their colours are. Though this is known to influence 
colour perception, it could not produce perceptions of colours out of nothing, so 
to speak, as in an outline drawing or black-and-white photograph; but it can 
help in doubtful cases such that a blue-green colour could be reported as green 
if it were recognized as grass or blue if it represented the sea on a map. This 
kind of recognition must play a part in producing the reports of colours seen in 
Land’s pictures—in fact one of his settings shows a dark-coloured packet clearly 


102 M. H. Wilson and R. W. Brocklebank 


labelled ‘ Reckitt’s Blue ’, and it would be very surprising if this did not influence 
the report to some extent, particularly if there were already some doubt as to 
its colour—nevertheless Land’s experiment 8 and others using objects whose 
colours could not be known beforehand are sufficient to show that memory of 
actual colours is not a necessary condition for perceiving a wider range than 
expected. 


Some Features of Object Colour Perception. 

The central factor of object colour perception was expressed by Helmholtz 
in the phrase ‘ discounting the illuminant’. By this he meant an unconscious 
process whereby objects often seem to keep the same colours even though the 
colour of the illuminant, and therefore of the light actually reflected from the 
object into the observer’s eye, has changed. The term colour constancy is also 
used to describe this property of object colours, and the term colour transforma- 
tion has been used for those cases where the light reflected from an object seen 
as such appears to differ in colour from the perception of the same light in isolation. 
The term colour conversion was proposed by Helson (1938) to include all changes 
in colour perception of the same object, seen in changed circumstances, avoiding 
the suggestion that any one of the perceptions is more normal than the others. 
Colour conversion applies where colour constancy breaks down. 

The most familiar case of colour constancy occurs with the natural changes 
in illumination from the cool blue light from a clear north sky, favoured by artists 
and colour-matchers, through the overcast greys to the warm yellow glow of 
the setting sun. Still more yellow are the various incandescent light sources 
such as the tungsten filament lamp, oil lamp, or candle. Yet with the change 
of illuminant white objects continue to look white, although the colour of the 
light they reflect has altered, and other coloured objects also keep their colours, 
even though they too reflect light of a changed colour. In this type of setting 
the actual colour of the illuminant is accepted as white, and the eye is said to be 
adapted to that illuminant; when the colour of the illuminant changes, the eye 
re-adapts and keeps pace with the change. 

Adaptation is simply the way that an organism fits its environment; it may 
describe either the state of fitting or the process of becoming fit. For vision, 
adaptation is the way in which an organism produces perceptions from a set of 
stimuli; if the illuminant is seen as uncoloured, then the adaptation is complete. 
It does not make any difference whether the process of change is fast or slow, 
but when Land rejected adaptation as a factor in his experiments he did so in the 
belief that it was always a gradual process. In fact adaptation to darkness, 
which is probably the best known example, can be very slow—‘ getting used to 
the dark ’ when coming out of a brightly lit house into a moonless night may take 
up to twenty minutes. ‘The reverse process, adaptation to brightness, is much 
quicker as it does not depend on the slow regeneration of bleached photochemical 
pigment. Chromatic adaptation over small changes in colour may also be quick 
enough to escape notice, though the perception of the illuminant colour as 
white may not happen as immediately as the perception of the white colour of 
objects. Even in cases of incomplete chromatic adaptation, in the sense that the 
colour of the illuminant is still marked, object colour perceptions may be such 
that adaptation appears to be complete, so that white objects look white. This 
is really a different class of adaptation, because the observer is not asked to report 


Colour and Perception 103 


the colour of the light reflected from the object, but the colour of the object 
itself; in other words, the concept of colour is not the same. 

The nature of the question facing the observer may be compared with the 
case of perceiving shapes. A number of circles and ellipses viewed from a 
variety of angles will present a variety of projected shapes in perspective. A 
competent draughtsman could reproduce the shapes as seen, but would still be 
able to pick out the circles from the ellipses by allowing for the tilt of the various 
planes. Such a ‘ discounting of the tilt’ might be due to recognizing that the 
elliptical shapes seen belonged to circular objects—plates, wheels, or coins— 
or might depend on seeing the wider context, perhaps a table-top, in which the 
shapes lie. In this connexion one could hardly claim that ‘ classical laws of 
Euclidean geometry conceal great basic laws of perspective.’ The process of 
perception, here as with the coloured scenes, consists in matching percepts with 
concepts so as to make sense: a percept ‘ elliptical shape’ matched to a concept 
‘circle ’ could make sense as ‘a circle tilted at an angle’, and in much the same 
way a percept ‘ pink patch’ matched to a concept ‘ white object ’ could make 
sense as ‘a white object in a pink light’. Similarly, just as there will be ellipses 
that turn into other ellipses and even a real ellipse that may project in perspective 
as a circular shape, there will be changes from one colour to another so that a 
percept of a magnenta-coloured patch may make best sense as a blue object 
in a red light, or perhaps a blue reflection on a red surface, while a neutral grey 
patch may be perceived as a green object in a (discounted) pink light, if this is 
more consistent with the surroundings. 

It must be stressed that by far the greater part of our colour experience is 
in the realm of object colour perception. Signal lights at night form about the 
only common case of aperture colour perception, comparable to the type of 
viewing employed in a colorimeter, in which discrimination in the colour of the 
light is at all important. Of course the stars have always been with us, but it is 
fair to presume that the language of colour names has grown entirely from object 
colours, as indeed their etymology bears out. Most people, faced with a scene 
and asked to name the colours in it, will unhesitatingly start naming the colours 
of objects. 

Discounting the illuminant colour, whether by physiological or psychological 
adaptation, can be fairly simply allowed for in computing colour perceptions 
from colorimetric data, by taking the colour point measured for the illuminant, 
for instance as reflected from white objects, and making it the central reference 
representing neutral colour perceptions, with all vectors representing other 
colours shifted accordingly, so that directions on the colour triangle (representing 
hue) and distances (representing saturation) are taken from a new point. This 
is only a first approximation, and does not take into account the brightness level. 
The average brightness over the scene, including both objects and background, 
was termed the adaptation level by Helson, who formulated a principle for object 
colour perceptions in chromatic illumination, which states that individual 
patches lighter than the adaptation level tend to take on the hue of the illuminant, 
and patches darker than the adaptation level tend to take on the after-image 
hue, complementary to the hue of the illuminant (Helson 1938). Judd has 
incorporated this principle into his formulae (Judd 1940). It is a principle 
based purely on observation, and does not depend on any particular theory of 
colour vision, though it must be taken into account in any attempt to assess 


104 M. H. Wilson and R. W. Brocklebank 


object colour perceptions. Thus in a Land-type picture projected with 
orange-red and yellowish tungsten light, the total illuminant mixed from 
these two appears to induce yellowness into light-coloured objects and blueness 
into dark objects, and in this respect Judd’s calculations agree very well with 
Land’s observations. 


Antt-tritanopic Conversion. 


There is one other effect which seems to be relevant to Land’s results, that 
was known as a peculiarity in colour vision, though nobody had pointed out all 
its implications for object colour perception until Land’s experiments came to 
be repeated; this is the effect of small-field, or foveal, tritanopia (see review by 
Hurvichand Jameson 1960). ‘Tritanopia is a type of dichromatic colour-defective 
vision in which neither blueness nor yellowness can be distinguished from 
greyness at the same lightness level, leaving only red and green as distinct hues. 
This defect is found even in normal trichromats, with full colour vision, if 
the sample is very much reduced in size, to the order of a few minutes of arc 
subtense. But if faced with an extended yellow surface, a normal observer is 
not aware of a small grey patch at the point of regard, even though he can match 
a small piece of the same yellow material with another that is grey; in other words, 
though the signal from the fovea may be ambiguous and identical with the 
normal signal for a grey colour, it is also consistent with it being a yellow colour, 
so the perception is more likely to be of yellow if this fits in better with the 
surroundings. Given such a very small patch of colour as part of a complete 
scene, the process of perception may well add yellowness to a light colour or 
blueness to a dark one if this makes better sense of the scene, even though 
there was no yellowness or blueness in the original. 

Land thought that angular subtense was not a contributory factor to his 
results, because his experiment 7 showed that red and green were not affected 
by size. But in his experiment 6, where yellow is distinguished from pink and 
green-blue from deep-blue, it is noticeable that the yellow and deep-blue objects 
are much smaller than the pink or green-blue areas. With similar pictures, it 
is found that the yellow objects change to pink, and blue to green, when viewed 
from near to, so that the areas subtended at the eye are increased (Wilson 
and Brocklebank 1960). Karp, in repeating Land’s experiments, found that 
whereas red and blue-green colours were consistently reported, yellow and blue 
were markedly subtense-dependent (Karp 1959). 

If a set of coloured patches is painted with colours that are red, strong pink, 
pale pink, blue-green, dark blue-green, and dark brown, respectively, mixed 
only from an orange-red pigment (vermilion) and a blue-green pigment (viridian) 
with white and black where necessary, all mounted on a white or pale grey card 
and viewed from a distance so that the individual patches subtend less than one 
degree at the eye, many observers report perceptions of red, orange, yellow, 
green, blue, and violet or purple (Wilson and Brocklebank 1960, Karp 1960). 
This result, if staged in yellowish light, could be accounted for by the Helson 
principle, and in fact similar results are obtained even at much larger angular 
subtense in yellow illumination; but where it occurs in an uncoloured illumina- 
tion it must be due to a compensation for small-field tritanopia, a process 
which may be termed anti-tritanopic conversion. 


Colour and Perception 105 


Colour Rendering and Reproduction. 


When considering a scene composed of illuminated objects, the actual colour 
of the illumination is not the only factor that can affect the perceived colours 
of the objects. Hluminants that look alike in colour can still differ in their 
colour rendering Properties, and where an illuminant becomes chromatic, 
although adaptation may compensate to some extent, it cannot restore distinc- 
tions that have been lost. 

In a scene lit with yellow light, a piece of white paper reflecting the yellow 
light may well be perceived as white, but a piece of yellow paper, which reflects 
the yellow light equally well, will look exactly the same, and so will also be 
perceived as white; the distinction between white and yellow of equal brightness 
is lost. Similarly, the distinction between dark blue and black is lost, and if 
the Helson effect restores the perception of blueness, it will do so equally for 
black as for blue objects. On the other hand, in a yellow light in which only 
the short wave radiations are missing, as in the light from a candle or through a 
yellow filter, red and green objects will still be quite distinct, although there is 
loss of distinction between yellow-red and blue-red, yellow-green and blue-green, 
orange and mauve, brown and purple, and so on. With such an illuminant 
the total colour range is reduced from a three-variable set to a two-variable set. 

Another yellow illuminant might have just the same colour to look at, but 
produces very different results in the colours of objects. The sodium vapour 
lamp reduces the colour range to a one-variable set so that only differences in 
lightness remain—all differences in hue, even between redness and greenness, 
are lost. It is still possible to distinguish between darkness due to shade or 
shadow and darkness due to the colour of objects, although quantitatively the 
absence of light is in each case the same. ‘Thus an observer aware of the wider 
context of the setting can pick out with fair certainty very light (or white) objects 
and very dark (or black) objects, but in all middle values the actual colour remains 
completely ambiguous and indistinguishable from grey. 

A similar state occurs with a ruby-red dark-room lamp illuminating a 
collection of objects: bright red objects might be taken for white, adaptation 
to the light might make it look orange or yellow rather than red, with deep blue 
perceived in the shadows and dark objects, but still there is only a one-variable 
set of colour differences. Precisely the same appearance is produced by taking 
a red-separation photograph of the scene, and projecting the positive with red 
light; this is an accurate reproduction of the scene as illuminated with a red 
illuminant. Now if a green-separation positive is added in correct register, 
projected in green light, the double projection reproduces the scene as illuminated 
with a broad-band yellow light, and there is a two-variable set of colours. A 
third projection in blue light of a blue-separation added to the other two, re- 
produces the scene as illuminated with a full white illuminant, as first demonstra- 
ted by Maxwell just one hundred years ago (fig. 2). 

It is clear that although three separations, one in each primary colour, are 
necessary to reproduce the full tri-variant range of colours, two will suffice to 
produce a natural scene in a yellow light. (The same is true of a cyan or magenta 
light, but neither of these is common enough to be accepted as natural.) Most 
of the pairs of projecting colours used by Land in his experiments 12-16, and 
all the more successful of the monochromatic pairs, added up to make a total 
illuminant that was yellow or yellowish. Although a two-variable range of 


106 M. H. Wilson and R. W. Brocklebank 


(a) Red separation. 


(b) Green separation. 


Colour and Perception 107 


(c) Blue separation. 


Fig. 2. Three-colour separation positives of a still-life group. (a) Red separation, taken 
with a red filter, recording longer-wave radiation reflected from the scene; (b) green 
separation recording middle-wave radiation; (c) blue separation recording shorter- 
wave radiation. ‘The objects include, from left to right, a white newspaper, blue 
and amber fruit glasses, light green glass ashtray, red and purple matchbox, blue 
cigarette packet, deep orange napkin (flat), turquoise napkin in silver ring, blue-and- 
white cup and saucer, dark green stoneware jug, mahogany tray with brass handles, 
green fruit glass, green-, blue- and red-wrapped biscuits, blue-and-white soup dish, 
pink cream jug, gold-embossed red cigarette box, yellow butter on blue-and-white 
saucer, yellow knife handle, deep blue napkin in orange-yellow ring, pottery mustard 
dish with turquoise rabbit, red, white and blue wheels and yellow inside; white 


linen table cloth and dark green curtains. 


Note that if the blue separation is not used, there is loss of distinction between some 
of the colours, e.g. the pink of the small jug and the yellow of the butter, or the blue 
and the green of the fruit glasses. 


108 M. H. Wilson and R. W. Brocklebank 


colours can never be as complete as a three-variable set, it may still contain all 
hues, since hue need no longer be independent of saturation and lightness; it 
may even contain all colours, in the sense of at least one example from each main 
colour-name block. But confusions or ambiguities can only be avoided, when 
reproducing a scene with a two-colour process, if the subject matter itself is limited 
in the range of colours displayed; such selection of objects is also apparent in 
Land’s pictures. 


White White 


Cyan 


Red Cyan 
Red 


(a) Black ay aces 


Fig. 3. (a) Colour realm of two dimensions, derived from the additive mixture of two 
primary lights. This corresponds to a vertical section through the colour solid 
in fig. 1. Only two hues are present, one complementary to the other. Each such 
pair of primaries gives a different section through colour space, but in each case the 
resultant represents the maximum available light and is interpreted as the illuminant, 
as reflected from white objects. ‘This remains true even if the plane is a tilted, rather 
than a vertical, section, with a coloured resultant. (b) Instead of plotting the amount 
of light available, it is possible to plot degrees of darkening, with the maximum available 
light (whatever it is) as the origin, and the density in the slide projected with the 
“warm” component of the light plotted against the density in the slide projected 
with the ‘cool’ component. (Ihe same can be done in three dimensions for a 
three-component system.) Plotting the darkness in this way gives a fairly good 
correlation with object colours, which result from darkening the illuminant by 
absorption, and remains more or less independent of variations in the primary lights; 
it is equivalent to Land’s ‘ new co-ordinate system ’. 


A two-colour projection system can give a true white total illuminant by 
choosing the primary colours to be additively complementary, for instance, red 
and cyan (fig. 3). In this case the hues of the projecting colours may be highly 
saturated, but the hues in quadrature to these, purple-blue and greenish yellow, 
will be non-existent; the Helson effect cannot add colours to objects where the 
apparent illuminant is achromatic (in the strict sense), and yellow or blue 
perceptions will have to depend on anti-tritanopic conversion in small patches 
or on the aid of colour memory. Apart from neutral greys, black and white, 
the colours of objects will have one of two hues, either ‘warm’ or ‘ cool’, in 


Colour and Perception 109 


various degrees of lightness and saturation. Such an appearance does not 
conform to any naturally illuminated situation, as seen by a normal observer, 
and though all the information in the picture is quite accurate as far as it goes, 
the effect is quite unsatisfactory because the promise of colour is only half 
fulfilled. 

A simple black-and-white photograph gives even less information about the 
colours of objects, as only a single variable is involved, but it is still quite accept- 
able, perhaps because no hopes are raised for colour perception. In fact even 
a black-and-white photograph gives some information about the colours of objects, 
since light and shade due to illumination and shadow is distinguished from 
lightness and darkness due to the colouring of surfaces; there is even a good 
chance of correctly naming the actual colours of black or white objects. Thus 
the amount of information in such a picture is the same as if the scene were 
illuminated by a monochromatic light, although the actual appearance is unlike 
that of any natural (or possible) illuminant. 

It is worth emphasizing the amount of colour-information in a black-and- 
white photograph in order to put in proportion the claims for two-colour photo- 
graphs. It is quite misleading to suggest that practically all the colour- 
information suddenly appears with the addition of the second projection, and 
that a third then adds little or nothing. A monochrome picture indicates the 
lightness of objects; it may give no indication of hue, in this sense being colour- 
less, but if the illuminant is chromatic there may well be perceptions of two 
different hues besides achromaticity, as Helson has shown. ‘This does not mean 
that the hues of the objects are being reproduced; they still remain quite am- 
biguous, since the perceived hue is here a function of lightness. In a typical 
two-colour picture, the warm-cool aspect of hue is reproduced as well as the 
lightness; thus, besides black and white objects, there is a good chance that red, 
brown and green objects will be correctly named. In spite of the long list of 
colours perceived, possibly including all hues, there remains a large degree of 
ambiguity in the reproduction of all other colours. Only in a three-colour 
picture is all ambiguity eliminated, with hue, saturation, and lightness as in- 
dependent factors. Thus with the addition of the third projection the number 
of unambiguous hues leaps from two to as many as one chooses to discriminate— 
estimates vary from twenty to two hundred, according to the criteria adopted. 


The Totality of Perception. 
Perceptions of the world are mediated by the sense organs, but these do not 


work like passive recording instruments. Simple response to a localized 
stimulus may be the first stage, but in the course of the development of a con- 
scious sensation there is a continual balancing of values and contrast of polarities. 
We have come to accept that the objective reality behind colour is the radiation 
complex, measured in part with wavelengths. Isolated colour sensations may 
be specified as a balance of wavelengths, and colour perceptions within a wider 
field may be computed from a balance of all the colours present. ‘Thus there 
appears to be a direct link from the physical radiations to the subjective percep- 
tion. But it is the final perception which is the immediate reality for conscious- 
ness, whether it correlates closely with a physical entity or not, and in fact this 
final perception is also influenced by the mental acts of recognition, interpreta- 


tion, and so on. 


110 M. H. Wilson and R. W. Brocklebank 


Land’s experiments mostly present a situation that can be interpreted as a 
number of coloured objects in a coloured light. Each of the many variations 
introduced—whether by changing the intensity of the two beams (exp. 
10), the density of the slide with a grey background} (exp. 11), or the colours 
of the filters (exp. 12-16)—brings about a new balance or adaptation, and is 
interpreted for the most part as a change in the illumination rather than in the 
objects. In some experiments, for example 17 and 18, the colour is such that 
it is all attributed to the illuminant (and discounted), leaving the objects all 
‘achromatic ’, while in others, 19, 20, 21 and 22, the patches of colour, however 
varied, just cannot be interpreted as illuminated objects and the perceptions 
agree more closely with those associated with isolated lights. Experiments 
5, 25 and 27 show that discounting an illuminant is not automatic for the whole 
field of view but is limited to the particular area in which it is recognized as being 
effective. 

Thus perceptions depend on what the mind can bring to meet what the 
eye sees, on what concept can fit the percept, on what sense can be read from 
the appearances. 


5. CONCLUSION 

The presentation of Land’s series of experiments made such a stir largely 
because it appeared to demand the overthrow of a venerable theory of colour 
vision. Now it is clear that Land’s assumptions about ‘ classical colour theory ’, 
particularly the relationship between colour measurement and colour perception, 
were erroneous, however widely they may have been entertained by the public. 

A number of scientists have investigated Land’s work and published their 
findings. Walls has shown how the concept of induction may account for the 
colours reported from the projected pictures; he has also pointed out many of 
the fallacies in Land’s argument (Walls 1960). Hurvich and Jameson, too, 
regard the demonstration as “ neither more nor less than stunning illustrations 
of simultaneous induction or contrast mechanisms ”’; they add: ‘‘ Land’s current 
rediscovery and dramatic exploitation of color contrast effects (even though he 
does not recognize their nature, history, and place in color theory) do serve to 
underscore the need for their inclusion in any analysis of changing color percep- 
tions under different viewing conditions.” (Hurvich and Jameson 1960.) 

Judd, in his very careful appraisal (Judd 1960), gives a valuable survey of 
previous work in this and in related fields, including an example of the application 
of his own formulae (Judd 1940) to a two-colour additive projection, and 
concludes that no new theory is required to predict Land’s results; while pointing 
out just where Land is wrong, Judd also indicates what he has newly learnt 
from Land’s work in the way of conclusions that were implicit in earlier state- 
ments but are now made explicit. 

Karp found orthodox concepts adequate to account for Land’s results, and 
devised simple experiments that would produce perceptions of a similar range of 
colours, but without using photographs of natural scenes (Karp 1959); he was 
also stimulated to make explicit the concept of anti-tritanopic conversion (Karp 
1960). Wilson and Brocklebank (1960) extended Karp’s series of experiments 
in an attempt to isolate the conditions necessary for the appearance of the contrast 


{Without a grey background, a change in density may cause changes in the colours 
of objects (see Wilson and Brocklebank 1960). 


Colour and Perception TL 


colours, such as discreteness, randomness, adapting background, object mode, 


and angular subtense ; they too found existing concepts sufficient for understanding 
Land’s results. 


Wright considered Land’s two-colour projection to be “an elaborate form 
of Goethe’s coloured shadows experiment”. (Wright 1959). He goes on to 
say that in this kind of research, “ the end result is too often described in quali- 
tative terms only, and effective progress is unlikely to be achieved until we find 
some way of actually measuring the colour appearance”’. This, from one who 
has done as much as any in putting colorimetry on its feet as an exact science, 
shows that the problem of assessing colour appearance is quite distinct from the 
measurement of colour as such. 


Even without his controversial theory, Land’s demonstrations remain most 
dramatic whenever they are reproduced, and will surely continue to inspire and 
stimulate workers in the field of colour perception. His hypotheses will have 
borne fruit, as Judd remarks, by suggesting new experiments. When the text- 
books come to be re-written, we can expect there to be changes, not quite 
as the journalist envisaged them, but directing more attention to the problems 
of perception, and making quite clear the distinction between colour as a charac- 
teristic of the light, and colour as a perceived property of objects, dackening 
the light, or, as Aristotle put it, providing special kinds of blackness. 


REFERENCES 


BELLO, F., 1959, An Astonishing New Theory of Color, Fortune, 59, 144 (May 1959). 
GoETHE, J. W., 1810, Zur Farbenlehre, sections 62-80 (Tubingen). 
He son, H., 1938, The Principle governing changes in Hue, Saturation, and Lightness of 
Non-selective Samples in chromatic illumination, 7. Exp. Psych., 23, 439. 
Hurvicu, Leo M., and JaMEsoN, DoroTHEA, 1960, Color Vision, Annual Review of Psy- 
chology, 11, 99. 

Jupp, D. B., 1940, Hue, Saturation and Lightness of surface colors with chromatic illumina- 
tion, 7. Opt. Soc. Amer., 30, 2; 1960, Appraisal of Land’s Work on 'Two-Primary 
Color Projections, /bid., 50, 254. 

Karp, ArTHuR, 1959, Colour-Image Synthesis with Two Unorthodox Primaries, Nature, 
184, 710; 1960, Tritanopia and Two-colour Image Synthesis, Jbid., 188, 40. 

Lanp, Epwin H., 1959 a, Color Vision and the Natural Image, Proc. Nat. Acad. Sci. (US), 
45, 115 & 636; 1959b, Experiments in Color Vision, Scientific American, 200, 84 
(May 1959); 1959c, Letters, Ibid., 201, 16 (September 1959); 1960, Some 
Comments on Dr. Judd’s Paper, 7. Opt. Soc. Amer., 50, 268. 

Newton, I., 1671-72, Phil. Trans., 80, Feb. 19. 

Optical Society of America Committee on Colorimetry, 1953, The Science of Color (New 
York: Thomas Y. Crowell Co.). 

Physical Society Colour Group, 1948, Report on Colour Terminology (London: The Physical 
Society). 

Rumrorp, Count, 1794, Phil. Trans., 1, 107. 

Watts, Gorpon L., 1960, ‘ Land! Land!’, Psychological Bulletin, 57, 29. 

Witson, M. H., and BrockieBaNnk, R. W., 1955, Complementary Hues of After-Images, 
¥. Opt. Soc. Amer., 45, 293; 1960, Two-Colour Projection Phenomena, 7. Phot. 


Sct., 8, 141. 
Wricut, W. D., 1959, Colour Vision: a Field of Unsolved Problems, The New Scientist, 6, 
447. 


The Authors: 

M. H. Wilson has had a life-long interest in colour photography and stage lighting; 
in 1948, with the creation of the Goethean Science Foundation, he turned to a study of 
colour perception. R. W. Brocklebank read Zoology at Cambridge and has since 1951 


collaborated with Mr. Wilson. 


Electroluminescence 


by D. W. G. BALLENTYNE 
The University, Hull 


1. GENERAL INTRODUCTION 


Luminescence was initially studied in natural minerals. If ultraviolet 
radiation of the correct wavelength is allowed to fall upon willemite (a natural 
mineral of zinc silicate containing traces of manganese), a green light is emitted. 
Materials which convert energy into visible radiation by such processes are 
called phosphors or luminophors. The material is called phosphorescent if the 
emission continues after the removal of the energizing radiation. In some 
materials the emission of radiation only occurs whilst the phosphor is being ex- 
cited by an external source of energy; these materials are said to be fluorescent. 
The nature of the exciting energy is indicated by means of a prefix, thus catho- 
doluminescence is the emission of light after excitation by cathode rays, photo- 
luminescence is excited by photons of ultra violet or visible radiation, and 
electroluminescence is excited by an electric field. 

Electroluminescence was discovered in 1938 by G. Destriau, who found 
that certain zinc sulphide phosphors when placed in suspension in castor oil 
between the plates of a capacitor could be excited by a low voltage, low frequency 
alternating field. During this process no heat was evolved. 


2. ZINC SULPHIDE PHOTOLUMINESCENT PHOSPHORS 


In some phosphors the process of excitation and emission occurs without 
the material becoming photoconducting, that is at no time during the 
luminescence process are free carriers released. The phosphors of interest in 
this article are, however, photoconductors. As a preliminary to understanding 
electroluminescence it is necessary to consider the general properties of photo- 
luminescent phosphors of the so-called sulphide type (the host crystal is a 
IJ-VI compound, e.g. ZnS). 

Zinc sulphide can exist in two crystalline forms—wurtzite, which has a 
close packed hexagonal structure and is converted at the transition temperature 
of 1024°c into a cubic modification, with a diamond structure, called sphalerite. 
Pure oxygen-free zinc sulphide when heated to 1100°c in dry hydrogen sulphide 
is not luminescent at room temperature. If, however, such a powder is 
moistened with sodium chloride and refired a brilliant green phosphor is 
produced. In general, however, minute quantities of impurities known as 
activators are added to the zinc sulphide to produce the brilliant blue and green 
phosphors available commercially today. The activator is added to the zinc 
along with a flux such as sodium chloride. It was believed, that, at the high 
temperature necessary to make a good phosphor, the flux melted and provided 
a liquid phase to facilitate the rapid crystallization of zinc sulphide. 'The more 
fundamental nature of this component was elucidated by the classical work of 
Kréger. It was shown that monovalent activators such as Cut or Ag* could 
only enter the lattice of zinc sulphide if an equivalent quantity of a monovalent 
ion (Cl-, Br~ or I~) called a co-activator was added at the same time. This 
observation led to the formulation of the principle of charge compensation in 


Electroluminescence 1113} 


phosphors. The introduction of foreign ions into the zinc sulphide lattice is 
only possible if electrical neutrality is maintained. Thus, if the divalent 
sulphur 1on 1s replaced by monovalent halide ion, then either a zinc vacancy 
which has captured an electron must be generated, or a monovalent activator 
must be introduced into the crystal. Likewise trivalent ions such as Al3+ 
In*+ and Ga** Can act as Co-activators in the zinc sulphide system by replcing 
a divalent zinc ion whilst an adjacent zinc ion is replaced either by a vacancy or 
by a monovalent activator. 


ENERGY 


DISTANCE 


Fig. 1. Energy diagram for a photoconducting phosphor. 


The processes occurring during emission in a photoconducting phosphor 
of the sulphide type are shown schematically in fig. 1. The electronic 
energy states of the host lattice are represented as bands. The topmost 
filled band A is called the valence band whilst the lowest empty band 
B is separated from the conduction band by a region of forbidden energy. 
The activator impurity atoms give rise to levels close to the valence band in the 
forbidden region D, whilst the co-activator produces levels C below the conduction 
band. The levels D are filled with electrons at ordinary temperautres whilst 
the levels C are empty. When a photon of energy Hy is absorbed an electron 
transition occurs from A to B. The hole in the valence band is trapped by the 
centre D which is therefore ionized. The free electron in the conduction band 
can return from its excited state by two processes. It can return directly to D 
giving rise to a photon of emitted radiation hy, or it can be trapped in the levels 
T produced by impurities or lattice defects. The return of the electron from 
T to the conduction band requires the absorption of thermal energy and the 
number of electrons returned to the conduction band is time-dependent. 
Providing retrapping does not occur, the free electron can now recombine with 
the ionized centre D with the emission of radiation. The process of direct 


recombination gives rise to fluorescence whilst the recombination of the trapped 


C3EB. - 


114 D. W. G. Ballentyne 


electrons after release to the conduction band leads to phosphorescence. 
Cathodoluminescence can be attributed to a similar process in which electrons 
are excited into the conduction band from the valence band by collision with 
high energy electrons supplied by the exciting cathode rays. 

The processes occurring in electroluminescence are complex and will now be 
considered in detail. 


3. ELECTROLUMINESCENCE 


The effect of the electric fields on the after-glow of photoluminescent 
phosphors was first described in 1920. It was observed that when a large 
electric field was applied to a phosphor previously excited by ultraviolet radiation 
or X-rays a burst of light was obtained. Sustained enhancement of the 
emission was not observed with powder phosphors. In 1928, O. Lossew 
showed that steady emission of light could be obtained from certain silicon 
carbide crystals by the application of a D.C. field without previous excitation by 
ultra violet radiation. Whilst investigating these effects, G. Destriau obtained 
continuous emission of green light from a heavily copper-doped zinc oxysul- 
phide phosphor, by the application of an alternating low frequency field. Both 
the Lossew effect and the Destriau effect have been called electroluminescence. 
Following Ivey, the Lossew effect will be termed carrier injection electrolumi- 
nescence, whilst the Destriau effect will be designated intrinsic or A.C. electro- 
luminescence. Although this review is concerned with A.C. electroluminescence 


Ca) 


=f 
—> ee TE 


H 
Cb) 


Fig. 2. (a) Destriaw’s Electroluminescent Cell. A. Glycerined salt solution as a trans- 
parent top electrode. B. 0-001 in. thick mica sheet. C. Copper block as other 
electrode. D. Phosphor powder suspended in castor oil. (b) Electroluminescent 
Lamp. ‘The cell has been greatly increased in thickness which in reality is 0-010 in. 
E. Sheet of conducting glass. F. Phosphor suspended in plastic. G. Barium 
titanate in plastic. H. Evaporated aluminium back electrode. 


Electroluminescence 115 


a brief digression will allow the simpler case of carrier injection electrolumin- 
escence to be described. Carrier injection electroluminescence has been 
observed in other materials besides SiC, for example Ge, Si, and GaP. The 
colour of the emission is usually that to be expected by a band-to-band transition 
(BA, fig. 1). The electrodes in this case are in contact with the crystal and 
it is assumed that a p—n junction effect is produced. Under the action of the 
field, holes, say, are injected into the n-type region of the crystal where there is a 
high concentration of electrons. Recombination between the free holes in the 
valence band and the electrons in the conduction bands causes the emission. 
For a survey of this subject see the monograph by D. Curie. 

Intrinsic electroluminescence is more difficult to understand, as the electrodes 
are not in contact with the phosphor. In Destriau’s cell (fig. 2) the electro- 
luminescent powder was suspended in castor oil and placed in a copper block. 
A sheet of 0-001 in. mica was placed on the suspension. In order to observe 
the light it is necessary to have a transparent top electrode. Destriau used a 
conducting solution of glycerined salt for this purpose. Such a cell is 
cumbersome and had no commercial possibilities. The electroluminescent 
panel, however, is robust, and portable. The phosphor is suspended in a 
thermosetting plastic of the acrylic group and applied to a sheet of conducting 
glass. ‘The plastic layer is of sufficient electrical strength for the field to be 
applied across it directly, and there is no need to incorporate a mica sheet. In 
order to obtain the maximum emission a layer of barium titanate of high 
reflectance is put on the back of the phosphor layer, and a non-transparent 
electrode of aluminium is evaporated onto the back. Electroluminescent 
phosphors are very susceptible to the action of water vapour and, therefore, 
the panel is waterproofed with wax. 


4. PREPARATION AND PHYSICAL PROPERTIES OF ELECTROLUMINESCENT 
PHOSPHORS 


A.C. electroluminescence has been observed both in powders and single 
crystals. Only phosphors based on a II-VI host material appear to produce 
efficient electroluminescent phosphors and the present discussion will be 
confined to this class of materials. 

The first such phosphor, described in 1947, consisted of zinc sulphide to 
which 10-? gm Cu/gm ZnS had been added. This concentration of copper 
is an order of magnitude greater than that used in efficient photoluminescent 
phosphors of the same class. The dry powder was fired in a stream of argon 
at 1200°c and the brightness of the emission was greatly enhanced by 
thoroughly stirring the powder in air after firing and then re-firing at the same 
temperature; the optimum brightness was obtained after two such operations. 
This increase in efficiency could be associated with the inclusion of oxygen in 
the phosphor and, indeed, it was found that brighter phosphors could be 
prepared by the deliberate addition of zinc oxide to the initial mixture. The 
efficiency of these powders was, however, very low. tne 1953s": Eh Homer, 
R. Rulon and K. H. Butler prepared cubic phosphors of greatly increased 
brightness compared to Destriau’s phosphor. ‘The composition was different 
for, in addition to copper ions and zinc oxide, a large quantity of lead ion was 
added. An essential condition for the preparation of electroluminescent 


116 D. W. G. Ballentyne 


phosphors of this type is that sufficient copper is added so that a sulphide 
of copper precipitates onto the surface of the crystals of phosphor. ‘The effect 
of the lead addition is not fundamental. The lead concentration in the final 
phosphor was negligible and the réle of this ion appeared to be to remove excess 
chlorine present in the original zinc sulphide by causing it to sublime as lead 
chloride. Many phosphors containing zinc sulphide as the high temperature 
modification (wurtzite) have been described. Trivalent aluminium ions and 
monovalent halide ions have both been used as co-activator for copper doped 
hexagonal phosphors. P. Zalm has studied the conditions necessary for the 
preparation of an efficient electroluminescent phosphor. These criteria arose 
from a study of zinc sulphide and zinc selenide phosphors coactivated with 
chlorine and prepared in the wurtzite temperature range. 
(1) The electron traps should be as shallow as possible, i.e. the choice of 
co-activation is restricted to Cl- or Al®*. 
(2) The activation energy for the escape of holes from the ionized activator 
levels must be as large as possible. 


(3) The zinc sulphide must be present in the sphalerite form, i.e., it must be 
annealed below 1000°c. 


Electroluminescence has also been observed in single crystals of zinc sulphide. 
To grow these crystals, zinc sulphide is sublimed at high temperature (~ 1250°c) 
in a silica tube with a temperature gradient along the tube, either 7m vacuo or 
with the help of an inert carrier gas such as argon. At a suitable temperature, 
single crystals of zinc sulphide grow from the walls of the containing tube. 
These crystals are of two types: 

(1) Hexagonal rods in which the C-axis of the hexagonal modification lies 

along the axis of the rod, or 

(2) Flat platelets in which the C-axis is parallel to a side of the plate. 

In certain cases the crystals exhibit one-dimension stacking faults perpendicu- 
lar to the C-axis which leads to the inclusion of thin layers of the cubic modifica- 
tion between the hexagonal material which is present in a large excess. In 
order to ‘ dope’ them, the singie crystals are immersed in a crucible of powder 
containing the desired impurity at the required concentration and then heated 


to high temperature for many hours, when the impurity diffuses homogeneously 
into the single crystals. 


5, ELECTRICAL AND OPTICAL PROPERTIES OF ELECTROLUMINESCENT 
PHOSPHORS 


In general, the properties of single crystals and powders of zinc sulphide 
phosphors are identical. The site of emission is more easily investigated with 
single crystals, its exact location is still uncertain. Piper and Williams, for 
example, observed that the emission occurred at the electrode which was 
becoming negative, whereas P. Zalm showed that light appeared at localized spots 
in the body of the crystal, which moved as the field changed polarity. On the 
other hand, ‘Tomlinson found that the emission from a zinc sulphide platelet 
occurred in lines perpendicular to the C-axis in between the electrodes; these 


lines of light were shown to coincide with areas of the crystal showing one- 
dimensional disorder. 


Electroluminescence iil 


D. Frankl suggested that the apparent localization of the emitted light was 
due to the very high refractive index of zinc sulphide. By immersing the crystal 
in a liquid of high refractive index the emission appeared to be more generalized. 
In a later paper, however, he concluded that emission can occur at a variety of 
sites. Many other conflicting reports have emerged in the past few years, but 
most people believe that emission only occurs at favoured centres in the crystal 
and is not a bulk effect. This is significant, for although electroluminescence 
of a crystal may occur at discrete points, the photoluminescent emission of the 
same crystal will occur throughout the bulk. 


6. BRIGHTNESS WAVES 


The brightness of electroluminescent emission changes with time, and this 
variation is known as a brightness wave. For a sinusoidal exciting voltage the 
brightness wave is modulated at twice the frequency of the applied field (fig. 
3a). The half brightness wave consists of two peaks: the primary peak A leads 
the voltage, whilst the secondary peak B depends both upon the voltage and 
frequency. Consecutive primary peaks of the brightness wave are of different 
amplitude. ‘This difference has been attributed to harmonics in the exciting 


BRIGHTNESS 


(2) 


TUME 


S 
BRIGHTNESS VOLTS 


TIME 


Fig. 3. (a) Brightness wave for sinusoidal exciting voltage; (0) Brightness wave for square 
wave D.C. pulses with transient pulse when field is off. 


voltage wave form. Experiments showed that a 6 per cent harmonic content 
in the applied waveform leads to a difference in peak height of 40 per cent 
whereas this difference was negligible for a harmonic content of only 0-3 per 


cent. The inequality in the peak height depends upon the nature of the backing 


118 D. W. G. Ballentyne 


electrode A (fig. 26), probably because the crystals only emit light at one 
electrode. ‘The asymmetry is due to self absorption of the light by the phosphor 
crystals of phosphors, or to scattering of light by interadjacent layers of powder 
or by the individual particles. 'The shape of the brightness wave depends on 
that of the exciting waveform (fig. 3b). Usually brightness waves can be 
divided into two parts: 

(a) An emission of light when the field increases from zero; 

(6) A second burst of light when the field returns to zero. 


7. DEPENDENCE OF BRIGHTNESS ON VOLTAGE AND FREQUENCY 
Many equations have been proposed to account for the observed dependence 
of electroluminescent brightness upon applied voltage. If the variation of 
brightness is small a power law relation of the form 
B=aV", where 3 <n <4 
adequately represents the voltage dependence. 
G. Destriau originally proposed an equation, 
B=a exp —(b/V), 
which was modified later to account for a slight curvature in the plot to 


B=aV" exp—(6/V) 


102 

10’ 

10° 

is} 
ws NO oS 
WW uJ 
GE. 4 FL 
ano EE 
a A 3 
(3) 3 O 
a 10 S 
2 faa) 

10 


oO 


FREQUENCY. 


(4) (0) 

Fig. 4. (a) Variation of electroluminescent brightness with voltage (after Zalm). A. 
Blue phosphor at 250 c/s. B. Orange phosphor at 1500 c/s. C. Green phosphor 
at 50 c/s. (6) Variation of electroluminescent brightness with frequency for green 
phosphor (after Zalm). A. Linear variation. B. 720 volts. C. 360 volts. D. 
240 volts. E. 120 volts. 


Electroluminescence 119 


where a,b and 7 are constants, the value of n being between 1 and 3 but generally 
2. If the range of observation only extends over one or two decades of bright- 
ness the exact form of the equation is not critical. An equation which fits the 


observations over nine decades for powder and crystal electroluminescent 
phosphors is: 


B=aexp— {b/,/(V)}. 

The frequency dependence varies from phosphor to phosphor. In the 
case of electroluminescent panels this dependence of the phosphor may be 
obscured or complicated by variation in the dielectric properties of the plastic 
in which the phosphor is suspended. In general, phosphors with a single 
emission band increase in brightness linearly or sublinearly with frequency. At 
high frequency there is a tendency for the brightness to saturate and even go 
through a maximum for simple phosphors containing copper as activator. The 
brightness of certain phosphors containing Fe, Co and Ni in addition to the 
activator increases superlinearly with frequency. 

The variation of the colour of the emission of phosphors containing multiple 
activators with frequency is complicated. The colour of copper-lead phosphors 
with emission bands in the green and blue changes with frequency. The 
brightness of the green band becomes saturated as the frequency increases 
whereas the brightness of the blue band increases linearly. Thus at low 
frequencies the green band dominates the colour of emission, whereas at higher 
frequency the colour becomes blue. Similarly copper-manganese electro- 
luminescent phosphors have both orange and green emission bands. At 
low frequencies the orange band dominates the emission whereas at higher 
frequencies the contribution of the green band increases. Thus the colour of 
the emission changes from orange to green. 


8. DEPENDENCE OF ELECTROLUMINESCENCE BRIGHTNESS ON TEMPERATURE 


The effect of temperature on the brightness of the emission of electro- 
luminescent powder phosphors is difficult to measure as the properties of the 
plastic binder vary due to the temperature variation of the dielectric constant 
of the plastic. This effect is best observed with single crystals. A curve due 
to G. F. Alfrey and J. B. Taylor given in fig. 5 for a zinc sulphide crystal 
which had no deliberately added impurities, is typical of the results obtained. 
At the temperature of liquid nitrogen the electroluminescent brightness is small. 
At higher temperatures (just below 0°c in this case) there is a very large increase 
in brightness. The brightness remains constant at higher temperatures al- 
though in some cases it decreases slightly. Repetition of the experiment at 
higher frequencies causes the step to move to higher temperatures and the 
extent of the shift is approximately proportional to the ratio of the two 
frequencies of the exciting field. 


9, 'THEORIES OF ELECTROLUMINESCENCE 


The various mechanisms proposed all associate electroluminescence with a 
barrier region in the crystal since emission appears to occur at isolated spots. 
A simple explanation of the effect is that the field causes valence-to-conduction 
band transitions and that the light is emitted by the ordinary processes of 


120 D. W. G. Ballentyne 


photoluminescence. But is has been estimated that for direct field excitation 
of the centres in zinc sulphide a field of 10’ v/cm would be required, whilst the 
breakdown field for zinc sulphide would appear to be of the order 10°— 10° v/cm; 
so this mechanism is untenable. 


LOG BRIGHTNESS 


TEMPERATURE 


Fig. 5. Variation of electroluminescent brightness with temperature. A. 1000 c/s. 
B. 2000 c/s. 


D. Curie proposed that electrons were transferred from donor levels 
(T in fig. 1) into the conduction band either thermally or by the applied electric 
field. The free electrons acquired energy in excess of that necessary to ionize 
the luminescent centre by impact after acceleration by the field. Using the 
kinetic-theory expression for the fraction of particles in a gas having a path length 
in excess of a certain given value, it is possible to derive an expression for the 
brightness voltage dependence of electroluminescence of the form originally 
suggested empiracally by Destriau, i.e. 


B=aV" exp—(b/V). 


Curie did not attempt a quantitative discussion of the factor aV” but considered 
it to be proportional to the number of free carriers. If acceleration over a 
certain distance is necessary for an electron to gain sufficient energy to ionize 
the luminescent centres, Destriau pointed out that a magnetic field at right 
angles to the electric vector of the applied field should quench electro- 
luminescence. A. N. Ince applied a magnetic field of 120 000 oersteds to an 
electroluminescent cell but no such effect was observed. There are two 
possible explanations of this negative result; either the field in the crystal far 
exceeds the applied field, or (less likely) the mass of the electron in the crystal 
differs greatly from the mass of the electron at rest. These results invalidate 
the acceleration theory given above, but suggest that the field is concentrated 
at some barrier in the crystal (fig. 6). Electroluminescence is thought to occur 
in a Mott-Schottky exhaustion barrier formed at the junction between an 
electron-rich layer such as a metal and a semiconductor. The positive space- 
charge found in the region A concentrates the field across this region. Electrons 
injected from C or from the traps D are accelerated sufficiently by the very high 


Electroluminescence 122i 


field in the crystal to ionize the luminescent centres B by impact. Recombina- 
tion of the free electron in the conduction band and the hole in the ionized centre 
gives rise to emission. The theories may be divided into two general classes. 

(a) Those which consider the field ionization of the electrons in the donor 
levels in the barrier region to be the rate determining step (e.g. Piper and 
Williams, Alfrey and Taylor, Zalm, etc.) 

(6) Those which consider that the electrons released by field ionization are 
trapped outside the barrier region in the bulk of the crystal. The rate of return 
of these electrons to the ionized luminescent centres in the barrier region is 
the determining factor (e.g. Haake, Morehead, etc.) 


Fig. 6. Exhaustion barrier model for electroluminescence by the impact ionization 
mechanism. A. Luminescent centres. B. The potential due to the applied field. 
C. Electrons from the barrier region removed to equalize the Fermi levels of the 
two materials. D. Traps. E. Mott-Schottky exhaustion barrier. F. Electron 
acceleration to attain sufficient energy to ionize the centres. 


Both cases give a relationship between brightness and voltage of the form 
B=aexp— {b/V(V)}. 

It might be expected that the total brightness would be directly proportional 
to the number of times that the voltage reaches a maximum per sec (that is, to 
the number of peaks per sec in the brightness wave). The saturation of 
brightness at higher frequencies must be attributed to the effect of electron traps. 
If the mechanism of electroluminescence is in some way dependent upon the 
rate of de-trapping of the electrons, then it might be expected to saturate when 
the time that an electron remains trapped becomes comparable to the period of 
the applied voltage. The decay constant indicates that the trapping time will 
be milliseconds or less, corresponding to a saturation of brightness at frequencies 
of kilocycles or higher. 

A qualitative explanation of the temperature dependence of electrolumin- 
escence can be offered for this model, if the rate of return of electrons tothe barrier 
is the rate determining step. At low temperatures (77°K) the return of electrons 
from the bulk of the crystal to the exhaustion barrier is sustained by impact 
rather than thermal detrapping, giving a small emission. At low to moderate 
temperatures the barrier field is relatively unaffected by thermal effects and the 
brightness increases with temperature because of the increasing number of 


122 D. W. G. Ballentyne 


electrons which can return to the barrier from the bulk of the crystal in one half 
cycle (because more electrons return to the conduction band). At high 
temperatures the positive holes escaping from the ionized activators diffuse 
from the barrier, thus reducing the rate of recombination and the electro- 
luminescent brightness. Williams has offered a more quantitative explanation 
based on the field ionization theory. It would appear, therefore, that the 
mechanism of electroluminescence is well understood and that theory and ex- 
periment are in complete accord. But the following section offers certain 
evidence which appears to be at variance with accepted theory. 


10. OBJECTIONS TO THE PRESENT STATE OF THE THEORY 


The impact ionization mechanism implies a threshold for electroluminescence 
since until the electron in the conduction band has acquired an energy equal to 
that necessary to raise an electron from the centre into the conduction band 
ionization cannot occur. Destriau reported that the emission of an electro- 
luminescent phosphor could be detected down to levels 1/100 of that perceptible 
to the dark-adapted eye and that there appeared to be no threshold. Thornton 
investigated the electroluminescence of a thin evaporated film of zinc sulphide, 
when with the applied voltage reduced to less than half the band gap electro- 
luminescence still occurs; though the electron cannot obtain sufficient energy 
from the field, which suggests that the electroluminescent process does not 
depend upon impact ionization of the luminescent centres. P’atek assumed 
that the electrons have a Maxwellian distribution of thermal energies so that 
sufficient energy can be obtained from the field by an electron of high thermal 
energy, that is, interaction by the electron with both a phonon and the electric 
field. No test of this theory has been made but the plot of electroluminescent 
brightness/voltage must be temperature sensitive at the lower voltage end. 

Goldschmidt has found that the intensity of emission of zinc sulphide powder, 
gradually eaten away by acid, did not decrease until the particles became 
exceedingly small. An electron-rich layer on the surface of the grains does not 
appear to be necessary; it is difficult to see how the brightness could remain 
constant as the size of the grain is greatly decreased unless electroluminescence 
occurs throughout the bulk. 

In a study of the correlation of electroluminescence and crystal structure, 
Mckeag and Stewart showed that the electroluminescence emission of phosphors 
prepared above the transition temperature was greatly increased by annealing 
below that temperature. The greatest effect was obtained by annealing at 
750°c when x-ray diffraction showed the powders to be mixtures of cubic and 
hexagonal zinc sulphide. 

Similarly the emission from single crystals was shown to occur in lines 
parallel to birefringent-grey bands in the platelet. These bands were due to 
hexagonal material being in contact with material with a one dimensional stacking 
fault. Ballentyne investigated the effect of copper added to zinc sulphide 
phosphors and showed that electroluminescent phosphors appeared to be a 
mixture of the two phases. The precipitation of copper ina phosphor prepared 
in the wurtzite range led to its conversion to the sphalerite form. More 
significantly a phosphor was prepared which did not contain an electron-rich 
material such as copper or silver but which was electroluminescent. This 


Electroluminescence 123 


phosphor was a mixture of cubic and hexagonal zinc sulphide and the method 
of preparation was such that a considerable number of cation vacancies may 
have been generated. 

The difference in band gap between hexagonal and cubic zinc sulphide is so 
small that it is hard to find sufficient energy to initiate electroluminescence at a 
barrier between the two materials. Yet some such mechanism may exist 
because W. J. Merz has observed very iarge photovoltaic effects In zinc sulphide 
crystals containing one dimensional stacking fault. These voltages may be a 
hundred times the band gap of zinc sulphide, and are so far unexplained. It 
is possible that a similar mechanism operates in the case of electroluminescent 
zinc sulphide phosphors and that the energy to ionize the luminescent centres is 
associated with a stacking fault. 


11. APPLICATIONS 


Electroluminescent lamps could revolutionize the lighting industry, as 
large-area capacitative light sources would be ideal for illuminating factories 
and offices. Such lights would cast no shadows. But the brightness of present 
day phosphors is too low by an order of magnitude for this purpose. 

During recent years attempts have been made to decrease the size of the 
cathode ray tube in a television set and nowadays the 110° tube is commonly 
employed in commercial sets. Electroluminescence could be used in a solid 
state television display unit that would not be very thick and which could be 
hung on the wall like a picture. If, in fig. 2b, the front and back electrodes 
are replaced by two orthogonal grids, then a spot of light could be made to 
appear on the panel at the intersection of two wires of the grid by applying a 
potential to the respective electrodes. Scanning such a panel requires only that 
the voltage can be switched to the correct pair of wires. A number of cumber- 
some systems have been developed but in the last year or so elegant methods 
using ferroelectric and ferromagnetic materials have been proposed. The 
solid state television tube may well be a reality within the next five years. 

The image on the fluoroscope screen during diagnosis by x-ray fluoroscopy 
is very faint, indeed, the radiologist needs to be dark-adapted to examine 
detail. The brightness of the picture cannot be increased by increasing the 
X-ray intensity as the maximum dose of x-rays a patient may have is limited. 
Complicated and expensive electronic devices using what is effectively a closed 
circuit television system have been constructed to intensify the picture. Electro- 
luminescence offers a cheaper and simpler method of accomplishing the same 
end. Imagine an electroluminescent panel and a photoconducting cell in 
series. In the dark the impedance of the photoconductor is large and an applied 
A.C. field will be totally dropped across it. The panel cannot, therefore, light 
up. If x-rays or light are allowed to fall on the photoconductor its impedance 
will decrease and the panel can emit light; the intensity of the emission depend- 
ing upon the impedance of the photoconductor and, hence, upon the intensity 
of the incident radiation. By choosing the correct impedances for panel and 
photoconductor such a device can be made to act as a radiation amplifier. A 
panel consisting of a number of elements of the type described above can be 
used as an image intensifier to replace the fluoroscope screen at present used in 


x-ray fluoroscopy. 


124 Electroluminescence 


Although electroluminescence has only been investigated for a decade much 
progress has been made. It is possible that the present researches which are 
basically fundamental in nature will lead to the formulation of new and more 
efficient electroluminescent phosphors which will make possible the construction 
of devices that will affect the everyday life of the whole community. 


BIBLIOGRAPHY 


Luminescence 
G. F. J. Garuicx, Handbuch der Physik, Vol. 26 (1958). 
H. W. Leverenz, Introduction to the luminescence of solids (New York: 1956). 


Electroluminescence 


General 
G. Destriau, Phil. Mag., 38, 700 (1947). 
P. ZaLm, Philips Res. Reports 11, 353-399 (1956). 
11, 417-451 (1956). 


Preparation and Properties 
H. H. Homer, R. Ruton, K. H. Butuer, ¥. Electrochem. Soc. 100, 566-571 (1953). 
H. C. Froenwticu, #. Electrochem. Soc. 100, 280-288, 496-507 (1953). 
H. F. Ivey and G. Destriau, Proc. I.R.E., 1911 (1956). 
A. H. Mcxeac and E. G. Stewart, #. Electrochem. Soc., 104, 41 (1957). 
D. W. G. BALLENTYNE, 7. Electrochem. Soc., 107, 807 (1960). 


Theoretical 
D. CuriE, Progress in Semiconductors 2, 251 (1957). 
W. W. Piper and F. E. Witiiams Brit. 7. Appl. Phys., 4, 39-44 (1954). 
P. D. JoHNson, W. W. PIPER and F. E. Wituiams, 7. Electrochem. Soc., 103, 221 
(1956). 
G. F. Atrrey and J. B. Taytor, Proc. Phys. Soc. B., 78, 775-784 (1955). 
C. H. Haake, 7. Electrochem. Soc., 104, 291 (1957). 
E. F. Moreneap, 7. Electrochem. Soc., 105, 461 (1958). 


The Author: 


Dr. D. W. G. Ballentyne is a graduate of London and Exeter Universities. He has 
worked on electroluminescence and photoconduction for the Marconi W.T. Co. Ltd. and 
A.E.I. before joining Hull University. He has published a number of papers on this 
and related subjects. 


Some Exhibits at The Royal Society Conversazione, May, 1961 


1. Determination of Wood Density Variation across Tree Growth Rings 
by means of @-particle Absorption 


(Forest Products Research Laboratory and Radioisotope Applications 
Unit of the D.S.I.R.) 


The density of timber is an index of its physical properties and of its quality 
(figs. 1). Conventional density measurements on small specimens demand both 
high precision and time. The method shown uses specimens about 100 pu thick, 


o AIR - 


van DRY 
ZZ ° 
®GREEN 
CG 
ieee = f 
G / 
am oe 
faa} | 
csi 
O Ce) 
S > 
Y 00° 
ae, O° 
: 1 gaffe 
O 20 T 3 / OF - 
Z o £ Be 
fo) 
a 08 
e A ae a 
Pho % oe \° 
O re) fy ° % 
a Sf “ey ®o0 
R507 “eo 
TB 1, 069 ar ry 
Lu BS ° e® e jhe 
= 7 6°f.d| © viel 
= ° See 
=) ve ° 
PS > 
x< 
< 
= | 
{ 
| | 4 5 
Ob 0.8) 1:0} 2 
fe) 20 40 60 Ib/ft® 
DENSITY 


Fig. 1 (a). The relation between the bending strength and the density of wood. 


cut from increment cores of 11 mm diameter; it records the density along the 
length of the strip and follows the change, in the annual growth ring, from the low 
density spring (early) wood to the higher density summer (late) wood (fig. 2). 


126 Royal Society Conversazione Exhibits 


A finely collimated beam of -particles, from C7, scans the strip con- 
tinuously. The absorption depends upon the density and the thickness of the 
specimen and so if specimens of the same thickness are used the absorption is 
a measure of the density. 


O=! Mc/s 


X= lOO Mc/s 


Ww 


DIELECTRIC CONSTANT 


ie) 20 


40 60 Ib/tt® 
DENSITY 


Fig. 1 (b). The relation between the dielectric constant and the density of wood. 


Each £-particle which passes through the specimen produces a flash of light 
in the anthracene crystal; photomultiplier and electronic amplifier magnify 
the output pulses whose rate is then registered on a counter (fig. 3). The 
variation of density in the specimen as it is moved through the beam is finally 
recorded on a graph. The apparatus is calibrated using wood of known density. 
Comparison of the results found by this method with densities determined by 
the weighing method showed that in most cases the difference between them 
did not exceed 3 per cent of their mean. The time required to traverse an 
annual growth ring of 5 mm width is about 12 min. 


Royal Society Conversazione Exhibits 127, 


Fig. 3. Density determination apparatus. 


128 Royal Society Conversazione Exhibits 


2. Kinetic Gas Analysis 
(Dr. B. M. Wright, National Institute for Medical Research, London.) 


Kinetic gas analysis is a method depending on the velocity of sound in a 
gas and its viscosity. It was first recognized as a possibility in Germany in 
1920, probably first used in its present form in 1949 in the U.S.A., under the 
name of ‘viscosity effusion’, and recently reinvented quite independently and 
almost simultaneously in the U.S.A. and in this country in 1959. 

When a gas is drawn through an orifice the rate of flow attains a limiting 
value when the velocity of gas in the orifice reaches the velocity of sound. 
This arises because the velocity of sound is the rate of propagation, through 
the medium, of pressure change and therefore a change of pressure on the low 
pressure side cannot be propagated upstream when the rate of gas flow is equal 
to the velocity of sound in it. Thus the volumetric flow rate through an 
orifice can be used as a direct measure of the velocity of sound in the gas. ‘The 
flow rate is measured by the pressure drop across a capillary tube which depends 
upon the viscosity of the gas but is independent of its density. 

The analyser (fig. 4) consists of a jet, A, a capillary tube, C, an alcohol- 
mercury U-tube manometer, D, and a connecting chamber, B. 


Fig. 4. Kinetic gas analyser. 


When the ratio of the pressures on the two sides of A is about 2 the steady 
state is reached and the manometer shows the fall in pressure along the capillary 
tube, which is proportional to the product of the volume rate of flow and the 
coefficient of viscosity. ‘The change in pressure arising in response to a change 
in the properties of the gas fed into the instrument has two components, i.e. 
that due to the viscosity change and that due to the velocity of sound change; in 
many cases these two components are of the same sign but even when they are 
of different signs the latter is so much greater than the former that useful 
readings are obtained. 

In practice air is used as the reference gas and then changes in the pressure 
arising when contaminated air is fed in are measures of the degree of con- 
tamination. 


Royal Society Conversazione Exhibits 129 


An instrument made of glass has the following characteristics—flow rate 
100 ml/min, vacuum 600 mm of mercury, response time 10 sec; changes in 
pressure arising from a 1 per cent contamination of air with—carbon dioxide 
lcm, helium 12 cm, fluothane 10cm. A demonstration using air with about 
20 per cent helium showed the quick response and the steady reading of the 
instrument. The sensitivity is such that the alcohol-mercury U-tube, or the 
aneroid type of manometer may be used and only for cases such as oxygen, to 
which the instrument is less sensitive, or where continuous recording is desirable 
is electronic amplification necessary. 


A carbon dioxide analyser is being used by the Himalayan Scientific and 
Mountaineering Expedition operating at 20 000 feet. 


3. Graphite 


(Professor A. R. Ubbelohde, F.R.S., Department of Chemical Engineering and Chemical 

Technology, Imperial College, London; C. A. Parsons & Co., Ltd., Nuclear Research 

Centre, Fossway, Newcastle on Tyne; and Hawker-Siddeley Nuclear Power Co. Ltd., 
Langley, Buckinghamshire.) 


In the early days of fission reactors it was known that as a moderator heavy 
water would give the greatest neutron economy; beryllium, or its oxide, was a 
possible moderator but it was difficult to handle because of toxic effects; benzene 
ring moderators were known but there was some doubt about some of their 
properties; demineralized light water demanded the use of an enriched fuel 
which at that time was not available. It emerged therefore that the nuclear 
power programme must be based on natural uranium with graphite as the 
moderator. Natural flaked graphite is too weak in structure and too chemically 
impure to be used in reactors; so the science and technology of synthetic 
graphite (made from high purity petroleum coke and high purity low ash coal 
based binder pitch) became, and still remains, a matter of national importance. 
In 1956 at Calder Hall and in 1959 at Chapelcross, graphite moderated magnox 
reactors came into action, and in 1959, when the OEEC High Temperature 
Reactor Project (DRAGON) was initiated the system chosen for study used 
graphite as the moderator. 


The exhibit showed the extensive range of the work being done on graphite. 
There were diagrams, photographs, various objects in polycrystalline graphite, 
and a model showing the interlocking structure of the synthetic graphite which 
is strong in comparison with the laminar natural graphite. The effect of treating 
with bromine was shown by applying the same potential difference to two 
identical rods of graphite, one of which had been bromine treated, with a lamp 
in series with each; the lamp in series with the treated specimen was remarkably 
brighter than the other; the increase in electrical conductivity is near tenfold. 
The explanation given was that the bromine abstracts electrons from the graphite 
leaving positive holes which facilitate conduction. 

Potassium graphite, C,,K, was on view—it is blue; the addition of more 
potassium gives rise to a golden compound. 


I 
C-P. 


130 Royal Society Conversazione Exhibits 


4. A Propane-filled Bubble Chamber 


(Dr. J. McKenzie and Mr. M. R. Bowman, Clarendon Laboratory, Oxford.) 


The volume of the chamber (fig. 5) is 1 litre, the operating pressure 350 
lb/in?, and an expansion of 3 per cent within about 10 msec is produced by 
coupling the piston to the flywheel of a small electric motor by a magnetic 
clutch. ‘This operation requires a square-shaped current pulse of amplitude 


SN By 
WHEN S¥q_-LENS 
LIQUID Y | 

PROPANE 7 -\‘) GLASS PLATES 


_ EXPANSION 
DIAPHRAGM 


OIL 


; Z] 
oe 
<3 y 
Ss 4 
‘\ 4 PISTON 


ING 


TO MAGNETIC CLUTCH 


PRESSURE 
RESERVOIR 


Fig. 5. The chamber, cylinder and operating shaft. 


4 amp lasting between 20 and 100 m/sec. A typical trace of the chamber 
pressure is shown in fig. 6. 

When the clutch is energized, a synchronized flash illuminates tracks pro- 
duced by the ionizing particles. 

The assembly was originally designed to test the magnetic clutch method of 
producing the expansion and when it was found to be effective the chamber 
was exposed to protons of about 80 mev to investigate the reaction C!? (p, 2p) BH. 


Royal Society Conversazione Exhibits 131 


l 
{ 
l 
t 
l 
! 
! 
I 
I 
l 
I 
i 
t 
{ 


PRESSURE 
= So (Rolbfine, — — 


Fig. 7. Protons of 80 Mev incident energy, one of which has interacted in the 
propane of the chamber (either a p-p collision or a C™ (p, 2p) B"™ reaction. 


132 Royal Society Conversaztone Exhibits 


About 20 000 expansions have been photographed. Examples are shown in 


fig. 7 and fig. 8. ge 
: The BEbbIs chamber was invented in 1952 by D. A. Glaser, of Michigan 


University, following the inspiration he derived from the contemplation of a glass 


of beer. He was a Nobel Prizewinner in 1960. 
The first chamber was simple and small and was intended to replace 


the elaborate complexity and large size of high-pressure cloud chambers. How- 


Fig. 8. Compton electrons produced by y-rays from Co. 


ever, development of bubble chambers was even more rapid and far-reaching 
than cloud chambers, and by 1959 the group under Alvarez at Berkeley, California, 
had produced a “ coffin chamber ” using over 500 litres of liquid hydrogen. 
The effort involved was about 65 man-years, and the cost about $2 000 000, of 
which one-tenth went to providing the magnet. 


Royal Society Conversazione Exhibits 133 


Towards the end of this year, a British National liquid hydrogen chamber 
will be completed; it is the result of a collaboration between the Universities of 
London (Imperial College), Liverpool, Birmingham and Oxford, under the 
direction of Professor C. C. Butler. 

The most widely used liquid is liquid hydrogen, since its ideal properties as a 
target for high-energy nuclear physics outweigh the severe technological difficul- 
ties. A wide range of other liquids has been made to work in bubble chambers, 
where special characteristics are suited to particular needs, e.g. deuterium, 
helium, propane, tungsten hexafluoride, xenon, etc. 

Since bubble chambers can expand every second or two, they can produce 
about a million sets of photographs per year. Hence the need for elaborate 
automatic machinery for measurement of data, and high-speed electronic 
computers for lengthy calculations. 


The National Physical Laboratory: Solids at Extremely 
High Pressures 


A new item on the research programme of the Basic Physics Division of the 
NPL is the study of the physical properties of solids at extremely high pressures 
of the order of 10° atm, with temperatures ranging from —200°c to 2000°c. 

The specimen is embedded in a tetrahedron of pyrophyllite, the four faces of 
which are squeezed uniformly by an anvil pressing on each face; the pyrophyllite 
flows and applies a pseudo-hydrostatic pressure to the specimen, and it also 


1 7 8 
v Y y 


n a ae ae ee 
6 5 2 3 4 


a 
n> 


Fig. 1. Pyrophyllite tetrahedron. 


. Tetrahedron of pyrophyllite; 

. The tetrahedron cut and bored to receive the specimen; 

. Sleeve to contain the specimen; 

The specimen; 

. Metal foil to be placed on the ends of the specimen and bent over as shown in 7 one end 
emerging on each face to make electrical contact with the anvil; 

6. Wedge shaped fillers to restore the tetrahedron; 

7. The tetrahedron ready for use; 

8. After use. 


uUbhWNe 


provides thermal and electrical insulation between the anvils. Three anvils 
are seated inside a wedge cone of gun barrel steel (fig. 2) and the tetrahedron 
is held, point downwards between them. The fourth anvil, applied to the 


The National Physical Laboratory 135 


horizontal face of the tetrahedron, is pushed vertically downwards by a simple 
mechanical press. The anvils in the cone are then pressed down along the 
curved surface of the cone, and so the tetrahedron is squeezed. Between each 
of the three anvils and the cone face a sheet of Teflon acts as a lubricant and a 
sheet of Terylene as an insulator should the Teflon break down. 


=i 


ee 


Fig. 2. A. Test sample in pyrophyllite tetrahedron ; B. Loading anvils (tungsten carbide); 
C. Cone wedge. 


The high temperatures are achieved by electrical heating of an inner cell 
which contains the specimen, and the low temperatures by circulating a cryo- 
genic liquid, such as liquid nitrogen, round the loading anvils. Connections 
for electrical heating and resistivity measurements are made by way of the three 
lower anvils and the loaded anvil, which is earthed, in contact with the metal 
strips. 

The equipment is used regularly up to 80 000 atm and has been calibrated 
using P. W. Bridgman’s fixed point resistance data for bismuth, thallium and 
barium”), 

The study of the variation of resistivity of semi-conductors with pressures 
up to 70 000 atm and temperatures from 120°K to 600°K has been carried out. 
The behaviour of a single crystal specimen of indium antimonide® is shown in 
fig. 4; the sharp fall in resistivity at about 30 000 atm is attributed to the phase 


136 The National Physical Laboratory 


change (believed to be melting) when the substance becomes essentially a 
metallic conductor. 


Fig. 3. Components of tetrahedral ultra high pressure apparatus. 


INDIUM ANTIMONIDE 
n TYPE ~4x 10'S ELECTRONS /CC 
SPECIMEN Size I'2mm nt 2mm. x 3mar. 


© TEMPERATURE 293°K 
x ” 164°K 


RESISTIVITY OHM CM 


10000 - 20000 30000 40000 50000 60000 70000 80000 


PRESSURE IN ATMOSPHERES 


Fig. 4. Behaviour of resistivity of indium antimonide at high pressures. 


The National Physical Laboratory 137 


It has been found by Bridgman(®, the pioneer of high pressure measure- 
ments, that for mechanically hard, high melting point metals the pressure 
coefficient of resistivity (1/R)(dR/dp) is of the order of —10-*, for some of the 
softer metals it is ten times as great and still negative, whilst for some substances, 
e.g. antimony, bismuth, aluminium it is of the order of +10-5; there is no 
“normal behaviour ’. 


High pressures afford the conditions for synthesizing new materials. In 
favourable cases, the simultaneous application of high temperatures and pressures 
can produce a rearrangement of the atoms which remains ‘ frozen in’ when 
conditions revert to normal, thus giving a new substance. Important examples 
of this are the formation of diamond from graphite, borazon from boron nitride, 
and coesite from quartz. 


As part of the development programme at NPL both coesite and diamond 
have been synthesized. The conditions required for the formation of diamond 
when nickel is used as a catalyst are 80 000 atm and 2000°c. Although diamond 
was first synthesized in the U.S.A. about six years ago this is the first occasion 
on which it has been made in the United Kingdom. 

Weslo: 


REFERENCES 


(1) Brrpcman, P. W., 1952, Proc. Amer. Acad. Arts & Sct., 81, 165. 
(2) Nature, Lond., 1960, 188, 1095. 
(3) Bripeman, P. W., 1958, The Physics of High Pressure (London: Bell & Son Ltd.). 


HISTORY OF SCIENCE 


This new annual review of literature, research and teaching in the History of Science 
is edited by A. C. Crombie (Oxford) and M. A. Hoskin (Cambridge), and published at 
Cambridge by W. Heffer & Sons Ltd. The Advisory Editors are G. Buchdahl, Herbert 
Butterfield, F. N. L. Poynter and W. P. D. Wightman. The review will _contain 
critical essays, book reviews, surveys of articles of special interest published in other 
journals, information about research in progress, and other matters relating to the 
professional development of the subject. The first number will be published early in 


1962. 


The Institute of Physics and the Physical Society 
Conference on Nuclear Physics 


Birmingham, 17th-19th April, 1961 


This Conference, sponsored by the Institute of Physics and the Physical 
Society and arranged by the Departments of Physics and Mathematical Physics 
of the University of Birmingham, continued the series initiated by the Physical 
Society in 1957. Usually two such conferences have been held each year. 

The Conference programme included selected topics from both high and 
low energy nuclear physics and, in addition, sessions on y-ray physics. Detailed 
questions of experimental or theoretical technique were discussed separately in 
parallel sessions on the last day, but otherwise the Conference met as a whole. 
The wide range of topics covered in the Conference reflected the belief of the 
organizers that it is desirable for people to hear about work outside their own 
specialities. 

A pleasant feature of previous conferences has been the presentation of 
certain of the Physical Society’s annual awards and this practice was continued 
at Birmingham. The President, Sir John Cockcroft, presented the Duddell 
Medal to J. B. Adams for his work on the design and construction of the 
CERN proton synchrotron; A. W. Merrison of the University of Liverpool 
received the C. V. Boys Prize for his work in high energy physics. During the 
Conference, the recipients each delivered an address based on his work. 

The sessions on y-ray physics were mainly concerned with the Mossbauer 
effect—the recoilless emission of y-rays—which provides sources of radiation 
extremely well-defined in energy. The subsequent resonant scattering of this 
radiation allows small energy changes of order 1 in 101” to be detected. This 
effect permits the determination of nuclear constants in a few favourable cases, 
but it has many applications to problems in solid state physics; the shift in 
nuclear energy levels depends on the environment of the nucleus in the crystal 
lattice. ‘This technique has already yielded some surprises. A notable example 
discussed was the determination of the magnetic field at the nucleus due to 
the electrons. ‘The experimental results show clearly that earlier theoretical 
estimates were far from correct. It was also emphasized that the Méssbauer 
effect was in many respects complementary to nuclear magnetic resonance 
techniques. From the point of view of the y-ray physicists, however, many 
questions of detail remain to be cleared up, and some time was spent on these. 

Nuclear physics is concerned with two rather different and largely separate 
problems. One is the elucidation of the forces that act between two nucleons. 
The other is the problem of nuclear structure and dynamics; even if the forces 
were exactly known, nuclei with more than two nucleons have too many degrees 
of freedom to permit an exact solution of the equation of motion. It is therefore 
necessary to make approximate models which emphasize particular aspects of 
the problem. A substantial fraction of the papers in the session on low energy 
nuclear physics dealt with a comparison between such calculations and experi- 
mental results. An encouraging feature that emerged was the extent to which 
nuclear level schemes from the currently accepted models agree with experiment. 


Birmingham Nuclear Physics Conference 139 


Further, it was clear that considerable progress has been made recently in 
understanding the apparent inconsistency of various models from a theoretical 
point of view, so that, for instance, the collective motion of nuclei is now well 
understood from a single particle point of view. 

The basic interaction between nucleons was considered in separate sessions. 
To study the details of the nuclear force, shorter wavelengths and thus higher 
energies are needed. It is obviously advantageous to study the interactions of the 
simplest system—just two nucleons—and to avoid the complication of a many 
particle system. Our present knowledge of the scattering of two nucleons up 
to a kinetic energy of 1000 mev was outlined and the further experiments necess- 
ary to determine all the phenomenological parameters were discussed. Rather 
sophisticated techniques involving several successive scatterings or the use of 
polarized targets as well as polarized beams were envisaged. It was made clear 
that the outer part of the potential is now well understood in terms of interac- 
tion via the meson field, and accurate experiments were described from which the 
strength of the coupling between the nucleons and the meson field can be de- 
duced. This value agrees with that obtained from other experiments, for example 
the production of mesons. It is clear from experiments like these last, which 
were not within the range of principal interest of this conference, that the 
nucleon itself is not a simple object, and that experiments at still higher energies 
will be needed to help us fully understand its properties and those of other 
elementary particles. J. B. Adams, in his Duddell Medal address described 
the design and operation of the CERN proton synchrotron, which is now 
producing intense beams of protons of 28 Bev and of z-mesons of 16 Bev as 
well as of other particles. Some experiments were reported suggesting that 
at very high energies some simple features again appear in the problem. 

Future possibilities for particle accelerators provided much discussion at the 
Conference. Adams referred to two possible developments in high energy 
accelerators; firstly the ‘ cascade synchrotron’, suggested by Matthew Sands 
for the 300 Bev region, in which the beam is transferred to a separate accelerator 
at about 10 Bev; the other suggestion was the development of ‘ storage rings’ 
for the particles from the present CERN machine. Each scheme has its 
advantages and it is not unlikely that both will ultimately be carried out. At 
lower energies the present interest in double and triple scattering experiments is 
reflected in a demand for high beam intensities. Various suggestions were 
outlined including the interesting proposal of a superconducting liner for a 
linear accelerator to reduce the R.F. losses (which take most of the power), and 
therefore to permit much higher intensities with the same R.F. power. The 
problem of refrigeration was considered and an estimate of £80 000 for a 50 Mev 
machine was mentioned. On other experimental techniques, most of the papers 
on instrumentation described more or less conventional particle detectors; some 
progress in the development of solid state detectors and automatic scanning 
devices was also reported. A. W. Merrison in his C. V. Boys Prize address 
discussed the problems of design of counter experiments, particularly those with 
low counting rates. 

The sessions devoted to theoretical techniques were largely concerned with 
collective excitations and their description from a single particle point of view; 
some time was devoted to simple models of reaction mechanisms at low and 


high energies. 


Report on the Fourth International Conference and Exhibition 
on Medical Electronics, New York, 1961 


by A. NIGHTINGALE 
St. Thomas’ Hospital, London 


1. THE CONFERENCE 


Organized by the International Federation for Medical Electronics the 
Conference was held at the Waldorf-Astoria, and was associated with the 14th 
Annual Conference on Electrical Techniques in Medicine and Biology. Of the 
1600 participants 250 came from overseas, and 25 countries were represented. 
300 papers were presented, in four parallel sessions, over a period of five full days, 
covering instrumentation for all fields of medical research and practice, data 
processing, computer diagnosis, theories of neural function and biological effects 
of microwaves. A single observer can do no more than offer some general 
impressions and report a few outstanding papers. 

In his opening address, Dr. Alfred N. Goldsmith (I.R.E. Editor Emeritus), 
defined the field of the Conference. Electronics, he said, had much to offer 
medicine, providing speed of operation, precision, memory storage and access, 
automatic computation and control of processes. At the same time he gave the 
timely warning that machines cannot replace the good judgement of the doctor: 
“electronics provides knowledge, man provides wisdom ”’. 


Noise 

In medical and biological research the instruments are often pushed towards 
the limits of sensitivity, limits which are set by ‘noise’, which may be extraneous 
interference, instrument noise, or noise arising in the source itself. In a paper 
on X-ray Image Amplifiers, A. E. Anderson and G. W. Goetze (Westinghouse) 
pointed out that it should be possible to make a significant improvement in the 
performance of present systems before the theoretical photon noise, which 
corresponds to the minimum dose-rate to the patient for the extraction of useful 
information, is reached. In the same field of work, E. W. Webster and R. 
Wipfielder (Massachusetts General Hospital, Boston) compared orthodox 
fluoroscopy, optical viewing of an intensifier, and T.V. viewing by Vidicon or 
Image Orthicon. ‘They concluded that in the Vidicon the limit of contrast 
perception was set by non-uniformity of the photoconductor. As expected, the 
Image Orthicon system could obtain the same useful information for a smaller 
patient dose, and in this case (under ‘laboratory’ conditions) the minimum 
perceptible contrast, for a given incident dose-rate of x-rays on the input screen, 
approached the theoretical value. One difficulty in this work is the lack of precise 
information on the photon content of an inhomogeneous beam of x-rays. 

Noise was also referred to by A. Nightingale (St. Thomas’ Hospital, London), 
in a preliminary study of the possibility of using myo-electric signals (picked up 
from the residual active fibres of a paralysed muscle) to control an artificial 
muscle. After reducing extraneous and amplifier noise, the limit in this case 
(and in all other electromyographic investigations) was set by a background of 


Report on the Fourth Medical Electronics Conference 141 


electrical fluctuations characteristic of living tissues, whose precise source was 
unknown, but which remained even when the subject was comfortably relaxed. 
The spectrum of this noise was concentrated at the lowest frequencies, so that 
some improvement in signal/noise ratio was achieved by filtering, even though 
this resulted in a lower absolute signal amplitude. 

The fact that one man’s signal is another man’s noise was exemplified by 
papers on the detection of the foetal electrocardiogram, a technique which is 
already providing the obstetrician with valuable information on the condition 
of the unborn infant. The mother is, of course, a source of interference in this 
procedure. In his concise and informative review of this field S. D. Larks 
(Marquette University, Milwaukee), indicated that in experienced hands correct 
placement of surface electrodes can provide usable records, but that improve- 
ments were desirable. Experiments with intra-uterine electrodes were reported 
by C. Sureau and R. Trocellier (University of Paris). These workers were also 
studying the possibility of cancellation of the maternal signal by the addition 
of opposite maternal potentials by means of a potentiometer. 

A more sophisticated approach to the noise problem is to use computers to 
obtain auto-correlograms. One example of this method, also in the field of 
foetal electrocardiography, was reported by A. G. Favret (Department of the 
Army, Washington). ‘The autocorrelation, ¢ (x), for a time varying signal of 
sufficient length can be obtained approximately by 

M=(N—2) 


OE ra ~ 


M=0 


Yur Yu +e 


in which WN is the total number of equally spaced discrete points considered 
along a curve representing the signal, and the Y’s represent the values of the 
amplitude at two points separated by a distance of x units. In his experiments 
Dr. Favret took nearly 1000 samples from a recording in which the signal was 
invisible in a background of random noise, transferred the data to punched cards, 
and fed it into an IBM—704 digital computer. The output was in the form of 
a plot of autocorrelation coefficients for different values of x (in this case corres- 
ponding to time intervals). A peak in this curve corresponded to the foetal 
heart rate. 

This complicated procedure shows the rather formidable nature of many 
attempts to use what should be time-saving data processing techniques. How- 
ever, as Dr. Favret himself pointed out, small special purpose computers are 
being developed which can be harnessed to the equipment to provide a solution 
within a few seconds. An outstanding example of a limited-purpose (but 
nevertheless versatile) assembly of computer components was that of J. 5. Barlow 
(M.I.T.). His equipment was designed for neurophysiological data analysis, and 
provided facilities for obtaining such parameters as the auto- and cross-correlation, 
the amplitude histogram, the interval histogram for base-line crossing, and the 


reaction time histogram. 


Radio Pills 


Last year at the London Conference radio pills for telemetering information 
from the gastro-intestinal tract were described. This year progress was reported 
in the design of transducers which make these pills even more versatile. For 
example, H. S. Wolff (Medical Research Council, London), discussed a 


142 A. Nightingale 


temperature-sensitive device. He used a recently developed nickel-iron alloy, 
whose magnetic permeability has a high temperature coefficient over the re- 
quired range, as the core of an inductance in a tuned circuit. This controlled 
the frequency in the FM transmitter (already used in his pressure-sensitive pills). 
A deviation of 40 kc/s in a 400 kc/s carrier was achieved for a 4°c change in 
temperature. H. G. Noeller (Heidelberg) described a transducer consisting of 
platinum and silver electrodes built into a pill for measuring oxygen concentra- 
tion. Apart from these continuing efforts to extend the range of measurable 
parameters, the most important outstanding problem in this field is the locali- 
zation of the pill inside the body. B. Jacobson (Karolinska Institute, Stockholm) 
described an ingenious antenna servo system for ‘tracking’ a pill. This 
consisted of a sheet of paper supported on a curved Perspex plate placed over the 
abdomen of the patient. A pen attached to the antenna carriage drew a map 
of the movements of the pill. The small loop antenna was rotated at 60 r.p.m. 
in a circular path parallel to the abdominal surface, and was driven by servo 
motors so that it sought a position of constant signal strength during the rotation, 
that is, a position in which the pill lay on its axis of rotation. ‘The signal strength 
gave some indication of the depth of the pill below the surface of the abdomen, 
although this information was not yet precise. The use of telemetering for 
measuring heart rate and respiration rate from athletes was described by V. V. 
Rosenblat (U.S.S.R.). The transducer and transmitter carried by the athlete 
weighed only 100 gm. ‘‘ Dynamic telemetry ”’, he said, ‘‘ has ushered in an 
important phase of sports medicine.” 

The founder-president of the International Federation for Medical Electronics, 
Dr. V. K. Zworykin, in his address at the Conference Banquet, announced the 
establishment of an official publication of the International Federation. This 
he said would aim at a high scientific standard and provide facilities for the 
exchange of knowledge between scientists all over the world. The journal 
would be published by Pergamon Press and would be called Medical Electronics 
and Biological Engineering. Dr. A. Nightingale (St. Thomas’ Hospital, 
London) had been appointed Editor. The first issue would appear in a few 
months time. 


2. "THE EXHIBITION 

One’s first impression of the exhibition was the tremendous range of ‘ black 
boxes ’ capable of detecting, amplifying, analysing, mathematically processing, 
analogizing, digitalizing, memorizing, and displaying almost any parameter of a 
biological system. 

Development is proceeding very rapidly ; commercial firms are applying well 
established data processing instruments already in use in industrial and military 
fields, and are also developing small special-purpose instruments. Some 60 
firms were represented, including a strong contingent from Japan. It was 
disappointing to find no British firms present, but Britain had provided an 
excellent show at Olympia last year. However, a number of British instruments 
were shown by their American agents, and in the scientific section of the exhi- 
bition, which was sadly unsupported by Americans, five out of the nine exhibits 
came from Britain. One stand was taken by the Biological Engineering Society ; 
this provided space for four exhibits by members of the Society, and attracted 
much interest. On an adjoining stand Wg.-Cdr. J. E. Malcolm (R.A.F. and 


Report on the Fourth Medical Electronics Conference 143 


Postgraduate Medical School) exhibited his working hydraulic model of the 
heart and coronary arterial system. Another outstanding scientific exhibit was 
shown by M. P. Spencer (Bowman Gray School of Medicine, Winston-Salem, 
N. Carolina). ‘This was a display of techniques used to obtain electrical analogue 
models of the circulatory system. The starting point in this work was a range of 
electromagnetic flowmeters, applicable to any unopened but surgically exposed 
vessel. Accurate waveforms obtained with these transducers on both animal 
and human subjects had been matched by waveforms generated by electrical 
models. Passive elements had been used initially, and more recently active 
elements which made the model more versatile, with easier control of coefficients. 

Returning to the commercial side of the exhibition, it is only possible to 
mention a few specific examples of the instruments on show, without implying 
that those selected are necessarily the best of their kind. 


The CAT (Mnemotron Corporation). 

This is a Computer of Average Transients, and is a small digital computer (no 
larger than a typical CRO) designed to extract a response, following a stimulus, 
from the noise background, even when the noise is many times larger than the 
response itself. This is an important problem in physiological investigations, 
e.g. in the measurement of neuro-electric signals associated with the stretch 
reflex in human subjects. The principle is that of averaging, in which the 
response signals following successive stimuli add linearly while the random noise 
adds up according to the square root of the sum of the squares. The averages are 
calculated for 400 ordinates spaced at chosen intervals. <A ferrite-core memory 
is employed. The stimulus is repeated as often as required, and the data 
scanned during the experiment and displayed on the built-in CRO, until the 
desired signal to noise ratio is achieved. Four different variables can be 
handled simultaneously. 


Infra-red microscope (Nippon Electric Co. Ltd., Japan) 

This instrument makes use of an infra-red image tube, with a light source 
with infra-red filter, and a conventional microscope. ‘The wavelength of greatest 
sensitivity is 1-34. The unit has been used for solid state studies, e.g. examining 
silicon crystals for stresses and impurities, and promises to be a useful sup- 
plementary tool in biology. 


Flash X-ray systems (Field Emission Corporation) 

This company showed a range of flash x-ray tubes with associated power 
supplies, using the principle of field emission in which electrons are drawn from 
a sharp tungsten needle (unheated) by a high electric field. ‘The electric field 
is pulsed and the current during the pulse (a fraction of a ps in duration) is 
of the order of 1000 amperes. In one unit, which has obvious possibilities for 
diagnostic radiology and for radiation biology, the voltage is 100 kv, the peak 
power 140 mw, the peak dose rate 10% r/sec, and the effective source size is 
much smaller than in conventional tubes. ‘The tube would of course require a 
suitable mounting for diagnostic work, but it is so small and light that this could 
be built quite easily in a laboratory workshop. 


R.F. pacemaker (Electro-Age Corporation) 
This was developed in collaboration with the Rockefeller Institute and the 
Yale University School of Medicine. The device is designed to provide pulses 


144 A. Nightingale 


of suitable duration (2 msec) and repetition rate (40 to 130 per min), to keep the 
heart beating when its own ‘natural pacemaker’ is damaged. A receiving coil 
is implanted below the skin and subcutaneous tissue in the middle of the lower 
sternum. Energy at 2 Mc/s is sent from the transmitting coil attached to the 
surface of the body. The complete transmitter is contained in a small box and 
is battery-operated. The receiving coil is tuned by a ceramic capacitor and 
incorporates a silicon diode, so that the R.F. is half-wave rectified. ‘Two flexible 
leads run to electrodes sewn at operation in the wall of the ventricle. The 
implanted unit is encapsulated in Silastex rubber (Dow Corning Co.), a material 
which appears to be very inert in human tissues and can be autoclaved. ‘This 
system has some advantages over completely implantable pacemakers (carryng 
their own batteries), since the stimulus can be fully controlled in amplitude and 
repetition rate. 


Multi-electrodes for electromyography (DISA Elektronik, Denmark) 

In addition to a number of DISA electromyographs and stimulators, an 
interesting range of multi-electrode needle electrodes was shown, including a 
bipolar type (two platinum leads running down a stainless steel cannula 0-6 mm 
in diameter), used widely for clinical electromyography, and several multi- 
electrode types. One of these is a needle, only about 1 mm in diameter, carrying 
fourteen 0-1 x 1 mm electrodes spaced over a total length of 20mm. The skill 
involved in the assembly of these units, containing 14 insulated wires running 
within the cannula, is remarkable. They were developed in Professor Buchthal’s 
laboratory in Copenhagen and are used in neurophysiological research to study 
the spatial distribution of electrical activity from muscle fibres in human and 
animal muscles. 


Automated zone comparator (‘Technical Controls, Inc.) 

This instrument automatically measures the diameters of zones of inhibition 
produced in microbiological assays of antibiotics. Six dishes containing the 
cultures are mounted on a revolving plate. Each dish is brought in turn into 
the reading position, in which it is illuminated so that an image of its zone of 
inhibition is received on two photoelectric tubes. These are driven by servos 
so that they seek the outer edges of the zone. When these have been located, 
the zone diameter is encoded on punched cards or tape. ‘The plate then rotates 
to bring the next dish into position. 


The Rutherford High Energy Laboratory 
of the National Institute for Research 
in Nuclear Science 


SUMMARY 


The National Institute for Research in Nuclear Science was constituted 
in March 1957 as a consequence of a Government decision to set up an 
establishment which would have as its main object the provision, for 
common use by universities and others, of facilities and equipment which are 
beyond the scope of individual universities and institutions carrying out 
research in the nuclear field. The Institute, which was given its Royal 
Charter in 1958, has on it representatives of the universities, the University 
Grants Committee, the Royal Society, the UKAEA, and the Department of 
Scientific and Industrial Research. The Institute’s first laboratory, the 
Rutherford High Energy Laboratory, has been established on a site 
adjacent to the AERE. The Director is Dr. T. G. Pickavance. The 
principal machines of the Laboratory are the 50 MeV proton linear accelerator, 
which was handed over to the Institute by the UKAEA in 1959, and the 
7 GeV proton synchrotron, now under construction by the Authority for the 
Institute, to which the name NIMROD has been given. (G stands for 
“ giga ’’—that is, 10°.) 

The general policy is that the Institute shall operate these machines for 
visiting teams of research workers from the universities, but the Institute’s 
staff will include some nuclear-physics research workers on fixed-term 
contracts. Originally it was intended that most of the operation, main- 
tenance, and machine development would be done by the Authority as a 
service; however it now appears that the size of staff required for NIMROD, 
when it comes into operation in 1962, will be such that these functions could 
be better undertaken by the Institute itself. 


1. THe Proton LINEAR ACCELERATOR 


The first accelerator of the Rutherford High Energy Laboratory to become 
operational was the proton linear accelerator, designed and built for the 
Institute by the UKAEA. The machine consists of three cylindrical resonant 
cavities in which pulses of protons, injected at 500 keV from a Cockcroft- 
Walton generator, are accelerated successively to energies of 10, 30, and 
50 MeV. Each resonant structure is contained in an evacuated tank, the total 
length of the three tanks being some 100 ft. Emergent pulses of protons pass 
through a shielding concrete wall into the experimental area, where a bending 
magnet will be installed to deflect the beam in any one of five directions, to five 
different experimental assemblies. It is expected that in full operation the mean 
proton current will exceed 5a (3x 10!% protons sect) whilst during the 
pulse the peak value will be a hundred times this; the pulse length is 200 ps 
and the repetition rate 50 pulses a second. 


2. NIMROD 


The name NIMROD (‘A mighty one in the earth”’, Genesis 10, 8-12; 
has been given to the 7 GeV proton synchrotron which is being built 
under the direction of the UKAEA for the National Institute for Research in 
Nuclear Science. The machine consists of a ring-shaped magnet 150 ft in 


CLP. K 


146 The Rutherford High Energy Laboratory 


diameter, weighing 7000 tons, between whose poles is placed a toroidal evacuated 
chamber; a pulse of protons, given an initial acceleration to 15 MeV in a linear 
accelerator, is injected into this chamber and the protons are forced by the 
magnetic field into a circular path, in which they receive an acceleration from a 
radio-frequency electric field once in each revolution. When the pulse has 
reached its maximum energy it is extracted from the vacuum chamber and 
allowed to pass into experimental apparatus. During the acceleration the 
magnetic field strength and the frequency of the electric accelerating field have 
both to be increased steadily and rapidly to confine the proton orbits to the 
magnet ring, and in such a manner as to maintain the delicately balanced 
stability in the motion of the protons. ‘The whole machine is housed in a semi- 
underground circular building of reinforced concrete 200 ft in diameter, with 
6 ft concrete roof upon which a 20 ft layer of earth is placed for additional 
shielding. 

The machine is designed to produce 101? protons per pulse at a repetition 
rate of 28 pulses a minute; this is equivalent to about 1/16 of auA. NIMROD 
should be completed during 1962. 


3. HisToricaL BACKGROUND 


Particle accelerators are of great importance in basic nuclear physics 
research; they are also used by the Authority to obtain nuclear, chemical and 
metallurgical data for the Authority’s reactor development programme. Until 
January 1961, the Atomic Energy Research Establishment, Harwell, carried 
out a considerable programme of research and development on accelerator 
design and built a variety of accelerators. Most of those engaged in accelerator 
development have now been transferred to the Rutherford Laboratory of the 
National Institute. 

Shortly after the war the AERE set up four accelerator groups which 
included many staff who had hitherto been engaged in the development of radar 
equipment. ‘Their specialist interests were electrostatic generators; cyclotrons; 
electron synchrotrons; and electron linear accelerators. For some years the 
last two groups were located outside the Telecommunications Research 
Establishment (now the Royal Radar Establishment) in Malvern. By 1949 a 
5 MeV Van de Graaff machine and the 180 MeV synchrocyclotron were operating 
at AERE, and electron synchrotrons and electron linear accelerators had been 
built for nuclear physics and medical purposes. Also, help was given in the 
development of such machines as the 300 MeV electron synchrotron at Glasgow 
University and the 10 MeV electron linear accelerator at Hammersmith Hospital. 

By 1950 the AERE accelerator effort had been concentrated into the General 
Physics Division under the late Professor H. W. B. Skinner and subsequently 
under Mr. D. W. Fry (now Director of the Atomic Energy Establishment, 
Winfrith). Three electron linear accelerators were developed and built for 
pulsed neutron studies, two tandem electrostatic generators were developed 
for AERE and AWRE in collaboration with industry, and effort on high energy 
accelerating machines for nuclear physics was directed towards a proton linear 
accelerator. Several universities were also engaged in accelerator development 
and machines were commissioned at Birmingham (1 GeV proton synchrotron, 
1953) and at Liverpool (450 MeV synchrocyclotron, 1954). 


147 


The Rutherford High Energy Laboratory 


‘1OJWII[IIOV IBouly uojo1d 941 JO IWIYISG 3 “SI 


: é 
= 
WOGE IOSINGD : 
: g 


WOON ONILHNDD — _ J ] 
Lo e f  ovsang (isis 


VIVA ONIOTaIHS 413NDNOD 


x 
if - GNYED ONITAWL 2 
HUGILY Id YOXINGS FOVLIGA HOI of 7 
: WHOOY 4OLDItNI 
JWH NOUISOX 


21eR BAA $8 


S3D915 ZANO 
3 s¥oivincdR 
oe 
ONIOIIIHS YNVE Hinya 
/ 


yg 


SNIGIUNS YNVe 


148 The Rutherford High Energy Laboratory 


As early as 1949 it was recognized that advanced equipment for high energy 
nuclear physics could be a fruitful field for European collaboration. A ‘ Euro- 
pean Council for Nuclear Research’ was set up as an interim organization in 
February 1952 and a Convention establishing the permanent ‘ European 
Organization for Nuclear Research’ (CERN) was signed in July 1953 by the 
United Kingdom and eleven other European countries. Two large accelerator 
projects had already been proposed in 1951, a 600 MeV synchrocyclotron and a 
large proton synchrotron. Accelerator physicists from U.K. universities and 
from AERE played a large part in the CERN co-operative venture. Work 
began on a site just outside Geneva (at Meyrin) in 1954 and the synchrocyclotron 
produced its first beam in August 1957. The alternating-gradient proton 
synchrotron achieved an energy of 24 GeV in September 1959 and 28 GeV in 
December of the same year. 


7 GeV PROTON SYNCHROTRON 


Brow.) NIMROD. It really is “in the earth ”, in that the whole machine 
and experimental area are covered by a large artificial mound. 


During 1953 the AERE accelerator development group, in co-operation 
with the universities, were designing a 600 MeV proton linear accelerator. 
However, by 1955, improvements in the extracted proton beam from the 
Liverpool cyclotron and the discovery that the “strange particles’ of the 
cosmic rays (kaons, and A, ¥, and & hyperons) could be produced by the 
Cosmotron (3 GeV) and the Bevatron (6 GeV) in the United States made it 
desirable to stop at the 50 MeV Stage and to transfer money and effort to the 


The Rutherford High Energy Laboratory 149 


development of a 7 GeV constant gradient synchrotron with as high a yield of 
protons as possible. This project was intended for pure nuclear physics 
research with the universities playing a major role, so the National Institute for 
Research in Nuclear Science was evolved in 1956. 

The 7 GeV proton synchrotron (NIMROD) was the first National Institute 
project. An ample site outside the main area of AERE, already accommodating 
the buildings associated with the proton linear accelerator, was offered by 
the Atomic Energy Authority. It was accepted by the Institute and named 
‘The Rutherford High Energy Laboratory’. The Authority were, initially, 
responsible for design and construction of NIMROD and provided staff and 
services to the Institute. The proton linear accelerator was still under construc- 
tion and was not offered at this time. 

The general development of the Laboratory and the construction of 
NIMROD have gone forward rapidly. The proton linear accelerator was trans- 
ferred to Institute ownership after it was commissioned; it is now being used 
for nuclear physics research by the universities. 


ACKOWLEDGEMENT 


Grateful acknowledgement is made to the Rutherford High Energy 
Laboratory for the information on which this article is based, and also for 
the two copyright illustrations. 


ESSAY REVIEW 


Reports on Progress in Physics, Vol. 24. (London: The Institute of Physics and the 
Physical Society, 1961.) [Pp. 424.] 


This collection of eight review articles is the latest volume in the well known series. 
One cannot help wondering whether any but editors and reviewers are likely to read the 
whole volume; a survey of the reading habits of physicists would provide some interesting 
results. It is commendable therefore that the separate articles are available in paper covers 
at a reasonable price, and in fact the complete collection can be obtained in this way more 
cheaply than the bound volume. 

Three of the eight papers are almost completely theoretical, and all three are really 
concerned with many-body problems. The first of these, on “‘ The Theory of the Super- 
conductive State’ by H. Frohlich (23 pages), is also the shortest of the collection. In spite 
of this it reaches back to the beginnings of the emergence of such theories and traces the 
development to date, with sufficient mathematical quotation to be definite, but without 
long or detailed calculations. The author himself explains that he is trying to give a physical 
picture of the process without the use of advanced mathematical language, and so far as 
recent work resists this process he feels that it is not yet clearly understood. In a rather 
unusual paragraph near the end, the author considers what he calls ‘‘ the psychology of 
superconductivity ”’, although the issues he raises are wider than this title implies. 

To the present reviewer the arguments he advances seem somewhat conflicting. First 
he mentions that editors of scientific journals are careful to eliminate all reference to human 
emotions, thus usually making the papers dull, and he advocates the establishment of a new 
journal for discussion and criticism in which, presumably, scientists would drop their 
masks of objectivity. Yet later he criticizes recent work for “‘ concern with keeping an 
upper hand in discussion rather than with illuminating the subject’. These are matters 
which clearly could be debated at some length (and would themselves probably generate 
considerable emotion!). The present writer is conservative in outlook on such matters 
and thinks the convention of objectivity a good one, in spite of its possible vulnerability 
to charges of near-hypocrisy. A poker face can often be of benefit to all. 

The second theoretical paper, on “‘ Theory and Applications of the Density Matrix’ by 
D. ter Haar (59 pages), is of a much more general character and is concerned with the 
application of two mathematical techniques, the density matrix and Green functions, to an 
astonishingly wide variety of problems. It is a source of wonder and enlightenment that 
a mathematical technique devised for some particular purpose can pervade and to some extent 
unify the most diverse branches of theoretical physics. Perhaps Fourier analysis could be 
cited as a similar example of this process. Yet the very universality of these schemes seems 
to detract from their role of explanation; if mathematics applies to everything, it tells us 
nothing in particular. To appreciate this paper fully one probably needs to be a thorough- 
bred theoretician with considerable mathematical technique and the time and ability to read 
a reasonable fraction of the six pages of references; but even with only a vague idea of what 
is going on behind the symbols, it is possible to follow the arguments presented with much 
interest and even some excitement. If the reader should want to know what the density 
matrix zs, he had better read the paper, but it is perhaps worth an attempt to help him by 
quoting the first of the author’s three short answers: ‘‘ It is the quantum-mechanical 
counterpart of the classical distribution function ”’. 

The third theoretical paper, on “‘ The Dynamics of High Temperature Plasmas ”’ by 
W. B. Thompson (62 pages) is concerned with the approach to a theory of the behaviour 
of fully ionized gases, free from inelastic collisions. The stimulus for this recent though 
classical development comes both from astronomy and of course from nuclear fusion research. 
The subject contains much in common with ordinary fluid theory, not only in the mathe- 
matics but in the way that much of it applies to idealized models which can hardly be expected 
to conform very closely to what actually exists. The usefulness of the models, apart from the 
insight they give, remains to be tested by experiment. 

There is a little overlap between the above paper and the paper by A. A. Ware on 
“ High Current Gas Discharges ” (45 pages), but the emphasis (and some of the notation) 
is different, the latter paper being concerned almost entirely with discharges for the study 


Essay Review P51 


of fusion, and attempts to stabilize them. This subject is both new and only recently 
declassified, so the author has the advantage of being able to start from scratch, and it is 
refreshing to read a good general account of what has been done after seeing so many 
confused scraps of information in newspapers. It seems a nice irony that the word ‘ plasma ’ 
means ‘ moulded’; this is the very thing we cannot do with it. 

The paper on ‘‘ Cosmic Radio Waves and their Interpretation’ by J. L. Pawsey and 
E. R. Hill (47 pages), gives a very brief account of the instrumental limitations and pos- 
sibilities, a fairly detailed one of the mechanisms of thermal, synchrotron and line emission, 
and then proceeds to discuss the bearing of results so far obtained on the structure of our 
own and of other galaxies. The paper seems to have been completed just before the recent 
peak in controversy over the distribution of sources and its bearing on the steady state 
universe. 

There are two papers on magnetism, the first, on ‘“‘ Ferrimagnetism’’, by W. P. Wolf 
(92 pages), is again on a relatively new subject, though it is a property of the oldest known 
magnetic substance, the lodestone. The understanding of such substances has advanced 
enormously in the last decade or so, partly as the result of the technical importance of ferrites, 
but many other ferrimagnetics are now known besides these. Some of these may be more 
useful for basic experimental purposes than the ferrites, for such reasons as available 
purity and ease of production of good crystals, as well as the intrinsic interest of different 
crystal structures. The paper surveys the magnetic, thermal, optical, microwave and 
neutron diffraction experiments which can be performed. ‘The emphasis is on physical 
properties rather than applications. 

The other magnetic paper, on ‘‘ Magnetic Domains ”’ by D. J. Craik and R. S. Tebble, 
(51 pages), gives an account of the usual minimum energy theory of domain patterns, 
followed by a survey of experimental techniques used for revealing them, and goes on to 
describe the results obtained on particular materials, with many remarkable illustrations. 
Although many observed domain patterns can be shown, after the event, to be minimum 
energy configurations, we do not have a sufficiently general theory to predict them, and have 
rather little knowledge about how they form in the first place. Some are so intricate that 
one would be tempted to call them ‘ ingenious’ if the personification of Nature were not 
so strongly forbidden in this century. 

Finally, the paper on ‘‘ Photoelectronic Image Intensifiers ”’ by J. D. McGee (45 pages) 
is the most practical paper of all, and to those of this inclination makes the easiest reading. 
The basic idea is clear and simple, for a given number of photons, a photo-cathode will 
release far more electrons than a photographic plate will produce developed grains. Since 
after acceleration the electrons can be detected individually, the photo-cathode should be 
a more sensitive image detecting device than the plate. "The way seems wide open, yet 
the obstacles to the realization of a satisfactory practical converter are remarkably obstruct- 
ive. Gas from emulsions ruins photo-cathodes, fluorescent screens and optical systems 
may well lose as much light as has been gained, and cascade electron optical systems tend 
to distort the image and destroy its resolution. Nevertheless impressive results have been 
obtained, and the benefits these devices could confer on astronomers, spectroscopists and 
radiologists are clear. 

As a general judgement on the volume, it is lucid, concise, and where the subjects are 
not very new, probably strikes the right balance between developing them from their origins 
and describing only advances in the last few years. But it is painfully clear on scanning 
such a book that in Physics as in the Looking Glass, it takes all the running you can do to 
stay in the same place, D. FE. Grsss. 


The British Association Meeting at Norwich 
Aucust 30—SEPTEMBER 6, 1961 


The programme of Section A (Physics) of the British Association meeting 
at Norwich was to some extent controlled by the subjects chosen by the President, 
Dr. R. Stoneley, the world authority on Seismology, and the Kelvin Lecturer, 
Professor D. J. E. Ingram of the University College of North Staffordshire. 

Professor Ingram opened the programme with a general introduction to the 
properties of the complete range of electromagnetic radiation, illustrated by 
diagrams and demonstrations. This led to a summary of the historical develop- 
ment of microwave and radiofrequency spectroscopy in recent years, followed 
by descriptions of the applications of microwaves to such things as the measure- 
ment of frequency and time. 

The presentation of this lecture was so delightful that one felt that 60 minutes 
was not enough to contain all that could have been enjoyed by the audience. 

A second paper, by Dr. J. Thomson, the Director of the British Scientific 
Instrument Research Association, dealt with further practical examples of micro- 
wave physics. The application to diagnostic and therapeutic medicine by the 
utilization of the dielectric constant and loss factor of human tissue was of 
particular interest. 

Stimulated emission of radiation for the direct amplification of radio-waves 
was the subject of the final paper on the first day by Dr. O. S. Heavens. Dr. 
Heavens dealt very ably with the ensuing discussion, which was concerned 
chiefly with the reasons for the undirectional characteristics of the enormously 
intense light sources which result when such devices, which are known as optical 
masers, are operated in the visible spectrum. 

The Presidential Address was entitled “‘ The Interior of the Earth ” and dealt 
with the evidence which has been accumulated to explain the structure of the 
earth in terms of a central core and the state of matter in this core. Other factors 
leading to earth movements were included. 

Dr. I. Maddock followed with a paper dealing with the detection of nuclear 
explosions in space and underground, coloured slides being used to illustrate 
the various effects produced by the explosions. The intense flux of x-rays 
produced by the very high temperatures arising in space were of special signifi- 
cance. ‘’he main problem of detection of underground explosions is not so much 
to exploit any particular effects due to the explosions but to discriminate between 
these effects and the normal seismic signals due to earthquakes. Some methods 
used for this purpose were referred to. 

The subject of Dr. Maddock’s paper has stimulated the application of 
sensitive electronic methods to the study of seismology itself so that natural 
phenomena can be distinguished from man-made earth shocks. Dr. P. L. 
Willmore described this work and also the speeding up of interpretation of 


seismological records by the use of modern computing techniques and improve- 
ments in recording apparatus. 


The British Association Meeting at Norwich 153 


The study of storm surges in British waters, described by Mr. G. W. Lennon, 
was particularly appropriate to Norfolk because of the great coastal floods of 
1953. ‘The mechanism of generation of such storm surges has been analysed 
and used to formulate a flood-warning system, while the study of abnormal sea 
levels has indicated the necessary height of sea defences at various points on our 
coasts. 

The last time Professor Sir Harrie Massey talked to the British Association 
about theories of the nucleus only six people came to hear him. In 1961 his 
audience was over 300. The present state of nuclear theory was treated in a 
disarmingly clear and simple fashion particularly appropriate to a British 
Association audience, and the regions of “‘ black ignorance” and ‘‘ reasonable 
knowledge ” in this field of physics distinguished. 

Dr. W. C. Marshall described the Mossbauer effect in such an enthralling 
way that this esoteric phenomenon giving us frequencies defined with very high 
accuracy under certain circumstances was obviously understood and appreciated 
by the same large audience. Dr. Marshall explained why the Doppler effect 
due to the movement of atoms does not ‘ blur’ the extreme sharpness of the 
“note ’ of y-rays emitted by some substances, by the analogy of a shunting train, 
and then showed how such accurately known frequencies could be used to 
check Einstein’s principle of equivalence. 


Dr. E. G. Michaelis came over from CERN, Geneva, to talk about the present 
state of knowledge of fundamental particles. He described the characteristics 
and inter-relationships of heavy mesons, hyperons and anti-nucleons and dis- 
cussed the impact of this new knowledge on certain basic ideas of physics. 


The combined session with the engineering section held on ‘Tuesday, 
September 5th was of great topical interest. ‘The whole morning was devoted 
to the consideration of unconventional methods of energy conversion, i.e. the 
production of electrical energy by the direct application of heat without the 
intervention of a steam cycle. Professor T. G. Cowling opened the discussion 
with a consideration of the general physical principles underlying the production 
of energy in nature and dealt in some detail with the real prospects of nuclear 
fission or fusion. 

The Deputy Chairman of the Central Electricity Generating Board, Mr. 
F. H. S. Brown, then considered the practical factors limiting further implemen- 
tation of conventional steam cycles, the most important being the lack of materials 
able to withstand high temperatures and pressures for long periods. Again this 
was a very clear and informed statement adapted very well to the audience. Mr. 
Brown included a warning that alternative methods of energy conversion must 
be economic as well as technically feasible. 


Details of three possible methods were discussed in turn by Professor M. W. 
Thring and Dr. R. G. Siddallt, Mr. P. D. Dunn and Professor D. A. Wright. 
Magnetohydrodynamics, or the production of electricity from the kinetic energy 
of fast-moving electrically-conducting gases in magnetic fields, was dealt with in 
terms of methods of controlling the electrical conductivity of the gases rather 
than in terms of the design of an operating system. A working exhibit 
illustrating the principle was shown by Dr. Siddall in conjunction with this 


paper. 
+ Combined paper by Professor Thring and Dr. Siddall. 


154 The British Association Meeting at Norwich 


Mr. P. D. Dunn went into the theory of operation of thermionic generators, 
which is based on the Edison effect. Small generators are of immediate interest 
for space vehicles, but it seems possible that high-power devices may eventually 
be developed. ; 

The mechanism of thermoelectric generation, utilizing the Seebeck effect 
with semiconductor materials used for the thermojunctions, was described by 
Professor D. A. Wright. He also discussed research work in progress which is 
aimed at improving present performance and outlined the factors leading to less 
than optimum efficiency. 

The session ended with two papers on Wednesday morning, by Professor 
D. E. Blackwell on “‘ The Zodiacal Light” and by Mr. M. O. Robins on 
“ British Experiments in Scout Satellites ”’. 

Professor Blackwell described the recent expedition to the Andes to make 
further observations of the zodiacal light, which can best be seen from high 
altitudes in the tropics, and presented evidence relating to two possible causes, 
i.e. scattering of sunlight by electrons or by material situated between the 
planets. He decided in favour of the latter. 

The final paper, by Mr. Robins, was concerned with the design and instru- 
mentation of Scout satellites. It was illustrated by a model showing the 
solar batteries and other features. Methods of overcoming hazards of high 
temperature and mechanical stress were described. 

If attendance and good discussions may be taken as criteria of a successful 
meeting, then the Norwich meeting of Section A was the best in recent years. 


M. V. GRIFFITH. 


Rutherford in Manchester 


by J. EB. GEAKE 
Manchester College of Science and Technology 


It is now 50 years since Rutherford, working in Manchester, conceived the 
idea that the atom had a small concentrated nucleus, and from this idea sprang 
the whole of our present-day knowledge of atomic structure and our exploitation 
of its consequences. ‘This great landmark in physics was celebrated by holding 
the Rutherford International Jubilee Conference early in September. It was 
appropriate that the Conference should be held at Manchester University because, 
although Rutherford did valuable work at Cambridge and at McGill, it was his 
Manchester period which produced the most important results, and the dis- 
coveries with which his name is mainly associated. It was also appropriate 
that there were two parts to the Conference—a commemorative session in which 
some of the surviving members of Rutherford’s Manchester team took us back 
by their reminiscences to those great days of the past, and also a full-scale con- 
ference setting out the present state of our knowledge of the nucleus. ‘To keep 
up with a rapidly-changing subject such as this, one must not spend too long 
looking backwards. 

Of those closely associated with Rutherford in Manchester, Marsden, Darwin, 
Chadwick, Andrade and Niels Bohr were all present, and it was greatly regretted 
that William Kay, Rutherford’s laboratory steward and personal assistant, to 
whom he acknowledged a great debt, did not live to be present at these 
celebrations; he died in Manchester only a few months ago. 

The main commemorative session of the conference consisted of the 
reminiscences of Sir E. Marsden, Sir Charles Darwin and Professor Andrade, 
and this was followed by a ceremony at which honorary degrees were bestowed. 
During the week, delegates saw something of the local Derbyshire scenery, 
visited Jodrell Bank and A.E.I. at Trafford Park, were received by the Lord 
Mayor at a lavish reception in Manchester’s impressive Victorian Gothic ‘Town 
Hall, and rounded off the week at a special concert given by Sir John Barbirolli 
and the Hallé Orchestra—the source of another of Manchester’s claims to renown. 

Concurrently with the Conference an exhibition of things associated with 
Rutherford was held—photographs, letters, models and, most interesting of all, 
some of his actual apparatus, including the piece said to have been his ‘ pet ’—the 
superb piece of glass-blowing by Baumbach which made possible the spectral 
identification of w-particles as helium. ‘The letters on view gave some interesting 
glimpses into the organization and economics behind the scene. ‘There was 
Schuster’s letter offering to hand his chair over to Rutherford (then at McGill), 
Rutherford’s answer making careful enquiries about the financial arrangements 
for research, and Schuster’s detailed reply saying how he spent his annual grant 
for teaching and research (all £450 of it!) and by how much it was safe to over- 
spend without getting into trouble. Rutherford was satisfied, and came in 
1907, and thus began the work in ‘Tom Tiddler’s field’, which was how 
Rutherford referred to one of the most celebrated research groups in the history 
of physics. Rutherford owed a considerable debt to Schuster for handing over 


156 J. E. Geake 


to him a well organized and relatively well equipped laboratory and teaching 
department. While the glory of discovering the nucleus falls to Rutherford, it 
was entirely owing to Schuster that the work was done in Manchester. 

As early as 1906 Rutherford, then at McGill, had realized, from the observa- 
tion that an w-particle beam was spread out slightly by passing through a mica 
sheet, that there must be surprisingly large electric fields within atoms, but it 
was not until 1911 that the idea of the nucleus was finally conceived. A trivial 
defect in an «-beam tube, which was cured empirically by inserting brass washers 
to confine the beam, suggested that a«-particles were reflected by metals. 
Rutherford suggested to Marsden, a second-year student (in those days under- 
graduates were given small research projects as part of their training), that he 
should follow this up. After some initial difficulties, because the available 
a-particle sources were too weak, Marsden eventually obtained a stronger 
source and did the experiment which is seen in retrospect to be one of the 
most profitable ever carried out. He directed a beam of a-particles at metal 
foils, and observed the range of angles at which they came off. ‘The result was 
staggering; although most of the particles were only deflected slightly, a few were 
turned through large angles, and a very few came almost back along their tracks. 
As Rutherford said later, it was as if one fired 15 in. shells at tissue paper, and 
found that occasionally they bounced back! Marsden told Rutherford what 
he had observed, Rutherford questioned him about the experiment to convince 
himself that it was all right, and that was all for several weeks, until a Sunday 
evening in the Autumn of 1911. Rutherford had invited several of his research 
workers to supper at his house in Withington, as he often did, and while they 
were chatting after supper Rutherford suddenly came out with his first ideas 
about the atomic nucleus; before they went home he asked one of them, Darwin, 
to check his hasty derivation of the scattering law to be expected when a-particles 
were deflected by point nuclei. They even discussed, on that first evening, the 
idea that, if the nucleus were not quite a point, departures from the law at close 
approach could yield information about nuclear structure. Although Rutherford 
did not live to see powerful enough scattering experiments performed, this is 
now the basis of modern methods of investigating the structure of nuclei and 
nucleons. 

In the months that followed Geiger and Marsden carried out more sophisti- 
cated scattering experiments than the one which had revealed the effect, and 
actually measured the angular distribution of the scattered «-particles. The 
results confirmed Rutherford’s scattering law and therefore the validity of the 
assumptions he had made in deriving it, and led in 1913 to a group of three papers 
which laid the foundations of nuclear physics. 

The commemorative session of the conference produced reminiscences about 
several of Rutherford’s group in Manchester; of Moseley whom Sir Charles 
Darwin (who worked with him) described as the hardest-working person he 
had ever known, and who was an expert in finding a meal in Manchester at 
3a.m.; of Niels Bohr who was a very comforting theoretician with great skill 
in bridging the gap between startlingly new theoretical concepts and classical 
ideas; of Robinson, a keen music-hall addict—and indeed of the music-hall 
origin of the correct intonation to Rutherford’s nickname of ‘ Papa’. 

While these reminiscences of the physics of 50 years ago were appropriate 
and entertaining, it was right that most of the time at the conference should be 


Rutherford in Manchester 157 


concerned with the physics of the present. There were nearly 200 contributed 
papers, and for those who want a detailed picture of the present state of nuclear 
physics these papers will shortly be published as a 750-page volume. The 
conference sessions, however, consisted of the presentation of invited papers, 
each intended to summarize a different aspect of the subject. 

Thirty years ago Rutherford said, “‘ It is my personal conviction that if we 
knew more about the nucleus, we should find it much simpler than we suppose. 
I am always a believer in simplicity being a simple fellow myself.”” The subject 
at present seems a long way from this simplicity; parts of the conference seemed 
to be in a foreign language, and at one point there were so many rival theories 
that they were referred to by reference numbers. Perhaps we need another 
Rutherford. 

The main topics reviewed included nuclear forces, nuclear structure, and 
the interactions with outside particles from which most of the evidence for 
nuclear properties is obtained. There was also a paper on the limitations and 
possibilities of the instruments for nuclear investigation, and another, rather off 
the main line, on cosmological dating by nuclear methods. 

It has long been understood that the attractive forces between nucleons 
(the neutrons and protons which comprise nuclei) were somehow concerned with 
the interchange of a particle (the 7-meson or pion) between them. There has 
also been evidence that sometimes two pions are in transit between the interacting 
nucleons at the same time, and the possibility of this occurrence modifies the 
force to be expected; although the theory of this process is still an unsolved 
problem, models describing the resulting behaviour have been proposed. 
What has only recently been confirmed—in fact it was announced at this 
conference—is that occasionally three pions at a time are involved. ‘These three 
pions may actually be joined together transiently as a compound particle during 
the interchange process; indeed, theoreticians have been invoking a compound 
particle of this type for some time. ‘There now seems to be evidence for its 
existence. 

A nuclear model which has been surprisingly long-lived and successful is the 
shell model, which was first proposed 25 years ago. ‘This assumes nucleons 
to occupy energy levels, obey quantum-number selection rules, and group 
themselves into closed shells in a manner analogous to the electrons outside the 
nucleus. This theory was given a new lease of life by adding the concept of 
nucleon spin, which undergoes coupling with the nucleon ‘ orbital’ motion. 
The presence of any nucleons in addition to the numbers which comprise closed 
shells will tend to distort the otherwise spherical shape, but these distortions 
were ignored in the approximate treatment of the problem. If there are only 
a few nucleons more (or less) than complete shells the mean distortion is indeed 
small, but the theory has been extended to include vibrations about this mean 
shape. With larger numbers of extra nucleons, mid-way between the numbers 
comprising complete shells, the nucleus is much more distorted, and rotational 
modes become important. With these larger numbers of extra nucleons it is no 
longer practicable to treat them singly and only their collective behaviour is 
considered. : 

The way nucleons are arranged in a nucleus, and especially in the surface 
regions of heavy nuclei, is another topic of current interest. Some workers 
consider that nucleons tend to be found singly or in pairs in the nuclear surface, 


158 J. E. Geake 


while others believe that there is more than a random chance of their being found 
in groups of four, although the grouping may be of a very transitory nature, 
the particles perhaps remaining associated for 10-*? of a second or so. Indeed, 
it is known that if a single particle, say a neutron, hits a nucleus it may result 
in the ejection of an a-particle (an assembly of 2 protons and 2 neutrons). 
However there was a vigorous argument at one session of the conference as to 
whether this w-particle existed in the nuclear surface and was knocked out by the 
neutron, or whether the incident neutron simply collected three more particles 
and itself became part of the resulting o-particle. ‘The evidence seems to be in 
favour of the former idea—that the four particles were already associated before 
ejection. 

Soon after Rutherford came to Manchester he and Geiger, using Geiger’s 
new a-particle counting techniques, were able to make the first measurements 
of the half-lives of radioactive elements. Nearly 20 years later, when Aston 
measured the relative abundances of the isotopes in lead (the end-points of 
radioactive decay series) from a lead-uranium ore, Rutherford realized that this, 
combined with his half-life measurements, could yield estimates both of the age 
of the earth (i.e. the time since solidification) and of the time since the actual 
formation of the heavy elements. Rutherford’s results increased the estimated 
time-scale for the Earth’s development by a factor of more than 10 over the 
currently accepted estimates due to Kelvin, and this advance produced the news- 
paper headline ‘ Doomsday Postponed’. Apart from Rutherford’s assumption 
that the amount of 2°°U initially formed was at the most equal to that of 238U, 
modern cosmochronologists would agree with him. It is now believed that 
235() was produced initially in greater abundance than 288U, and this, plus 
minor changes in the accepted values of other constants, pushes the estimated 
time since the formation of the heavy elements (loosely called the age of the 
galaxy) up from Rutherford’s estimate of 3-4 x 10° years to about 20 x 10° years. 
This age is confirmed by observations of clusters of stars associated with our 
galaxy. ‘I‘hese are assumed to consist of a homogeneous age group of stars, 
and if the absolute brightness of each star is plotted against its temperature, 
then the distribution for the whole cluster shows an abrupt departure from the 
straight line of the main sequence at a point depending on the age of the cluster. 
Results obtained by this method agree roughly with the galactic age estimated 
from nuclear data. 

The time scales considered at the conference thus ranged from nucleon 
association times of 10~* of a second up to the galactic age of over 10!° years—a 
factor of 10**. ‘The conference ended on the serious note of warning that 
nuclear physicists had a duty to use any influence they had to ensure that nuclear 
knowledge did not lead to the end of mankind, and with the hope that it would 
in due course be possible to celebrate the Rutherford centenary. 


The Author: 

Dr. Geake graduated at Manchester University and is now a lecturer in physics at 
the Manchester College of Science and Technology. His research interests include 
astronomical and other applications of photoelectric spectrophotometry. 


BOOK REVIEWS 


Physics and Archaeology. By M. J. ArrkeN. (New York and London: Interscience 
Publishers, 1961.) [Pp.x+181.] 43s. 


In recent years there has been more and more application of physical methods to 
archaeology. Subjects like the free precession of protons, radioactive decay, thermo- 
remanent magnetism and x-ray spectrometry, to mention just a few, seem to be becoming 
a normal part of the archaeologist’s vocabulary. A pre-dig survey often uses most up-to- 
date electronic (and transistorized) equipment, and preliminary results have even been 
“processed ’’ by a computer. 

Such a proliferation of techniques must be somewhat bewildering to the archaeologist 
without a degree in physics. This book describes some of the ways in which physics has 
found application and is sensibly restricted to techniques which have produced concrete 
quantitative results. Furthermore, the first page states firmly that the need to dig is not 
eliminated, lest the non-scientist should give up heart. 

The book provides a welcome introduction to a rapidly developing field. It should be 
of particular value to the physicist who is developing an interest in archaeology, for all 
topics are outlined clearly and numerous references are given to original material so that 
any specific points may be further investigated. But from the title an archaeologist might 
deduce that he will find the book as helpful as the physicist, and no doubt many will attempt 
it. ‘They are likely to find it rather hard going even if they follow the instructions and omit 
Chapter 3. It is hardly possible in so few pages to survey such a wide field and still give 
the non-scientist a clear picture of what is going on. F. R. McKim. 


The Science Study Series, Volumes 5-8. (London: William Heinemann Ltd., 1961.) 


Ever since Science News, we have suffered from the lack of well written monographs 
which may easily be read by the schoolmaster, the student or the layman. A partial 
reincarnation seems to have occurred in the form of the paperback Science Study Series, 
published in Britain by Heinemann in 1961, written in the United States for the now famous 
Physical Science Study Committee. Four books have already appeared over here; the 
group under review is the second batch of four. 


Waves and the Ear, by W. A. VAN BerGEIjK, J. R. Pierce and E. E. Davip, Jr., 5s., 
is an account of the physics of sound and of waves, woven cleverly into strands of physiology, 
zoology and psychology. ‘The wide experience of these authors has been used to illustrate 
their arguments with unusual and diverse information. ‘Thus the nature of sound waves 
and their behaviour and measurement is explained with the help of descriptions of the 
hearing of bats, of recent findings about fishes, of the vocal sounds of frogs and crickets. 
The ‘ volley’ theory of human hearing and the ‘Ohms’ Law’ of sound, are competently 
described together with an account of the physiology of hearing and the action of nerves. 
The style is informal, but it has the merit of interweaving many strands of science and of 
disclosing alternative theories and conflicting evidence in ways which should reduce 
dogmatic interpretation and stimulate further queries. 


The Physics of Television, by D. G. Fink and D. M. LutyEns, 5$., is somewhat 
more authoritative in style and less spiced with interesting tit-bits from physiology than is 
its counterpart on sound. It should help to explain the technology of television to the 


ordinary reader. 


Crystals and Crystal Growing, by A. HOLDEN and P. Sincrr, 10s. 6d., is a description 
in simple terms of the nature of solids and solutions. Concepts such as the increase in 
disorder’ are used from the start, recipes for crystal growing are given, with some hints for 
the chemist who wishes to deal with solubilities or phase studies. Diagrams and photographs 
illustrate the spacial geometry, and on the whole these are good. Perhaps the most 
interesting chapters are those which deal with twinning, piezo-electric effect, double 
refraction and optical isomerism. A useful reference book for the school chemistry 


laboratory. 


160 Book Reviews 


In The Birth of a New Physics, (5s.), I. BERNARD COHEN has marshalled the observations 
and arguments of the past to contrast with those of today. This has been done in order 
to show the ways in which modern physics developed out of the past and how many of 
us are still bound by the thinking of the ancients by our very word forms. ‘The other 
aspect of interest which I, for one, find intriguing, is the extent to which the men of antiquity 
were able to understand their problems in physics and to describe them. Perhaps the 
nuclear physics or the biochemistry of today have still to look for their Galileos, Keplers 
and Newtons, and if this book tends to reduce the dogma and authority which too many 
people today ascribe to science, it will be doing a very great service. D. S. FENsoM. 


Space and the Atom. Selected lectures edited by S. T. BuTLER and H. Megssev. (Sydney: 

Shakespeare Head Press, 1961.) [Pp. 425.] 84s. 

This volume is a collection of selected lectures from the fourth Summer School for 
Science Teachers given at Sydney in January this year. 

Over half the book is devoted to a basic course (given by the editors), on atomic and 
nuclear physics, electromagnetic radiation, and elementary relativity theory, ending with 
chapters on nuclear reactions and particle accelerators. 

The style is interesting and the material is presented in a logical manner, suitably 
illustrated and divided into easily assimilated units. "The text is marred, however, by 
occasional misprints and unnecessary references to another volume. (Anyone requiring the 
references is unlikely to follow much of the remaining text.) 

Professor Salpeter contributes an admirable introduction to quantum mechanics. A 
section on elementary particles by C. B. A. McCusker starts with stable particles and leads 
to a discussion of strangeness and force fields. 

W. Boas, on solid state physics, introduces the properties of crystals and polycrystalline 
substances before going on to the conduction of electricity in metals and semiconductors. 
His illustrations are good, and the electron micrographs of dislocation are enlightening; 
but fig. 17, p. 335, is incomprehensible to anyone without previous experience of such 
photographs. 

The lectures on ‘Space’ in the last eighty pages deal mainly with radio- astronomy 
techniques, including E. G. Bowen on the 210 ft Australian telescope, followed by an account 
of the resolution and sensitivity of radio telescopes by B. Y. Mills. The book concludes with 
a section on aeronautical theory by P. T. Fink. 

Although the sections vary in standard the book is stimulating and undoubtedly of 
value to students and science teachers for whom it is intended, especially if use is made of 
the bibliography. ‘The editors, in proposing that parents too might benefit, are probably 
rating the scientific education of the last generation too highly. J. CarEw-JONEs. 


Laboratory Physics. By J. H. Avery and A. W. K. Incram. (London: Heinemann, 1961.) 

[Eps 513.)) e258: 

The general impression created by this book is very good; it is well set out, gives the 
right basic ideas and covers a wide choice of experiments. 

A difficulty in practical lessons is to strike a balance between telling a pupil too much 
and too little. In the former case, instructions become in the nature of a cookery recipe. 
In the latter case a pupil may flounder, The heuristic method is just not applicable to 
modern syllabuses with each pupil doing a different experiment. This book, as in previous 
editions, strikes the balance very well. ‘The slower pupil need not be left behind while the 
able one has his interest stimulated. 

In any book of this range there are bound to be a few points which a teacher might 
query. In the viscosity experiment the need for a slow rate of flow is not emphasized. In 
finding the coefficient of expansion by balancing columns one would prefer both columns 
to be jacketed. In the conductivity experiment with glass tubing, the slow pupil might 
need more assistance with his calculation. In photo-electric experiments, difficulties 
often arise causing striking at varying potentials and a shield round the lamp would give 
more consistent readings. 

These are all minor points, however, and a careful reading of this new edition convinces 
one that it is the best practical physics book at this level which has yet been published. 

C. R. Warp. 


Physics Applied to Archaeology—Part I 


by M. J. AITKEN 


Research Laboratory for Archaeology and the History of Art, 
University of Oxford 


1. INTRODUCTION 


An archaeologist faces the task of reconstructing the social, economic and 
historic aspects of a past civilization from those few material objects which happen 
to have survived. From these he must extract the utmost, and having used his 
sight, touch and intuition, he calls upon the natural and biological sciences for 
further information. These two articles outline the part played by physics. 
The first covers methods of age determination and methods of chemical analysis 
using physical techniques. The second article will describe an additional 
method of age determination—magnetic dating—and methods of locating buried 
remains, notably by means of a proton magnetometer. 


2. RADIOCARBON DATING 


This is of predominant importance in prehistoric archaeology because there 
is good reason to believe that, within limits, it offers an absolute time scale, 
dependent only on the radioactive decay of carbon-14. The method arose as a 
by-product of fundamental research, and was developed by W. F. Libby from 
1946 onwards (Libby 1955, 1961). When cosmic rays enter the earth’s atmos- 
phere neutrons are produced. From laboratory studies of possible nuclear 
reactions using artificially produced neutrons Libby concluded that nearly all 
the cosmic-ray neutrons would end their lives by converting atmospheric 
nitrogen (14N) into radiocarbon (14C), the neutron cross section of oxygen being 
negligible by comparison with that of nitrogen. 

Because the chemical behaviour of radiocarbon is the same as that of ordinary 
carbon (12C) it forms heavy carbon dioxide and mixes in with the ordinary carbon 
dioxide of the atmosphere (present to about 0-04% by weight). Since plant- 
life grows by photosynthesis of atmospheric carbon dioxide and animals live off 
plants, all the animal and vegetable world (the biosphere) is very weakly radio- 
active. Atmospheric carbon dioxide also enters the ocean, as dissolved carbonate, 
and this too is weakly radioactive and any shells and deposits formed from it. 

The disintegration rate of carbon-14 corresponds to the decay by 1% every 
82 years—equivalent to a half life of 5800 years. ‘The estimated production 
rate of carbon-14 by cosmic-ray neutrons is 7 or 8 kg/year. ‘These two figures 
dictate the equilibrium amount of carbon-14 on earth—the amount for which 
the loss by decay is just balanced by creation in the atmosphere. ‘This amount 
is in fact (7-5 x 82 x 100) kg, i.e. 62 metric tons. Since the total amount of ordinary 
carbon in which the carbon-14 is diluted is about 40 million million tons (the 
atmosphere, the biosphere, and the oceans), the ratio of carbon-14 atoms to 
carbon-12 atoms is only 1 to 0-8 million million. 

Carbon freshly drawn from the atmosphere contains this equilibrium 
proportion of carbon-14. But in dead wood for example, the carbon-14 atoms 


C.P. L 


162 M. J. Aitken 


lost by radioactive decay are not replaced and the radiocarbon concentration 
slowly decreases—by 1% every 82 years. Thus by measuring the 4C/C ratio 
for a piece of dead wood and comparing it with the ratio for a piece of living wood, 
the age of the dead wood can be calculated. The calculation is based only on 
the laboratory measured decay-rate of carbon-14, but the whole principle of the 
method rests on the assumption that the C/!?C ratio for iiving plant and animal 
life has always been the same as it is today. The validity of this assumption 
will be discussed later but early confirmation at any rate of its approximate validity 
was obtained by Libby and his colleagues by measurements on wood from the 
tomb of the Egyptian King Sneferu, estimated to have died between 2700 and 
2550 B.c. from historical records. Figure 1 shows how this measurement and 
others made on samples of known age fit on to the decay-curve of carbon-14. 


Theoretical curve 


—_— > 
Specific radioactivity 
(disintegrations per minute per gram of natural carbon) 


0 2000 4000 6000 8000 10 000 


Historical age ——> (years ) 


Fig. 1. Activity of samples of known age (after Libby 1955). The horizontal width of 
each mark represents the uncertainty in historical age and the vertical height repre- 
sents the statistical uncertainty in the counting-rate (one standard deviation). 
The theoretical, exponential decay is shown by the solid curve. (Reproduced from 
Physics and Archaeology by courtesy of Interscience Publishers Ltd.) 


It was also confirmed that the C/'C ratio for living material is independent of 
latitude, for because of the magnetic field of the earth, the cosmic-ray neutron 
production over the equator is only about one quarter of that over the poles. 
However, atmospheric circulation produces sufficient mixing to remove any 
significant latitude dependence in the “C/!2C ratio. 


Measurement 

In living material the “C/"C ratio is only 1 in 0-8 million million, and in 
older samples it is less by a factor of two for each 5800 years. Measurement of 
this ratio by a mass spectrometer is out of the question because of the nearness of 
the atomic weights. However, it is possible to measure the ratio by determining 


—— 


Physics Applied to Archaeology 163 


the specific radioactivity of a given weight of carbon extracted from the sample. 
Carbon-14 decays back to stable nitrogen-14 with the emission of a B-particle 
14 14 _ 

S Sele ea ae 

Unfortunately the energy of the B-particles is only 160 kev. This means that 
they would be seriously attenuated by a 1 in. thickness of air or by a 0-0003 in. 
thickness of aluminium. So, for the f-particles to be detected by a nuclear 
counter, the sample must be introduced inside the counter itself. There are 
several ways of doing this; one of the best is to convert the carbon from the 
sample into carbon dioxide and use it, after very careful purification, as the 
counting gas in a proportional counter. Conversion to methane and to acetylene 
has also been used with success and also synthesis into an organic liquid 
suitable for use as scintillator with a photomultiplier. 

The “C/?C ratio for living matter, 1 in 0:8 million million, corresponds to 
the emission of 15 £-particles/min/gram of natural carbon, i.e., just under 7 
micromicrocuries/g. ‘The disintegration rate for samples which are, say 23 000 
years old (four half-lives) is one-sixteenth of this. The volume of a typical 
gas-proportional counter is about one litre and the gas pressure is several atmos- 
pheres. ‘Thus the counting rate from a 23 000 year-old sample will not be more 
than one or two counts per min. Consequently reduction of the counter back- 
ground is of the utmost importance. Part of this arises from materials of the 
counter, but most of the background is due to secondary cosmic-rays. The 
y-ray component is reduced by an 8-inch thick screen of steel. Massive though 
it is, this does not stop the meson component. The mesons are eliminated by 
means of an anti-coincidence shield; this consists of a ring of Geiger counters 
placed around the measuring counter. Any meson that reaches the measuring 
counter should have also produced a pulse in one of the Geiger counters and it 
is arranged electronically that a pulse from the measuring counter occurring 
simultaneously is disregarded. 

Even after all these precautions it is difficult to reduce the background much 
below one-tenth of the counting-rate for modern carbon. Consequently for old 
samples the sample counting rate is only slightly higher than background, and 
counting times of at least 24 hours are necessary in order to reduce the effects 
of statistical fluctuations in the counting-rate. 


Suitable Samples 

The event which is dated is the cessation of intake of carbon from the atmos- 
phere. Charcoal and well-preserved wood are highly reliable types of sample, 
because exchange of carbon from the big cellulose molecules is unlikely once 
photosynthesis has stopped. ‘This is proven by measurements on inner tree-rings 
—the radiocarbon activity corresponds to the date of formation despite the 
presence of carbon of different activity in adjacent rings. Samples of wood and 
charcoal derived from the inner rings of a slow-growing tree (such as oak or 
redwood) would yield a radiocarbon date several hundred years older than the 
the archaeological event of felling the tree. 

Peat, leaves, nuts, hair, skin, leather, paper, cloth, dentine, charred bone and 
antler can also be dated. In all samples the greatest care has to be taken to 
exclude contamination with ‘modern’ carbon. For a sample that is 23 000 


164 M. J. Aitken 


years old the addition of 1% of modern carbon increases the specific activity of 
the sample by 16%, corresponding to an underestimation of its age by 1300 years. 


Some Applications ; 

Written history goes back only 5000 years. That is in Egypt, where the first 
calendar was established—based on the flooding of the Nile and on the rising 
of the bright star Sirius (Sothis). Elsewhere in the ancient world reliable 
calendars were developed much later. However, trade and war between 
different countries meant that the distinctive styles of pottery, ornament, weapon 
etc. of one country permeated into another so that contemporaneity between the 
stratigraphical levels of widely separated sites can be established and the relative 
chronologies linked together. The relative chronologies are eventually made 
absolute by carrying the process into a country, such as Egypt, for which a 
recorded calendar exists. A notable example is the ‘ dagger’ carving observed 
on one of the Sarsen stones of the third phase of Stonehenge; this is of a form 
only parallelled in the Mycenaean shaft graves of southern Greece associated with 
the legendary figure of Agamemnon, and dated 1600-1500 B.c. 

One valuable réle of radiocarbon dating is to check the validity of such 
associative inferences—in fact the radio carbon date obtained for an antler 
recently found under one of the Stonehenge sarsens was 1730( +150) B.c. 
Radiocarbon dates have not always been in line with archaeological ideas however. 
Until a year or two ago the accepted archaeological date for the beginning of the 
Neolithic Age—in which settled farming replaced hunting as a means of liveli- 
hood—was roughly 2000 B.c. Repeated radiocarbon measurements on charcoal 
from this period have consistently given dates of around 3000 B.c., necessitating 
a reappraisal of the archaeological evidence. However it is prior to the beginning 
of the Egyptian calendar that radiocarbon dating becomes really vital to prehis- 
toric archaeology. Measurements on two charcoal samples from the Mesolithic 
(i.e. pre- Neolithic) levels at Jericho have given dates of 7900 ( + 240) B.c. and 7870 
(+240) B.c. This is surprisingly early and indicates that man’s first steps 
towards an urban civilization on the amelioration of the climate after the last 
Ice Age took place remarkably quickly. 

The chronology of the final stages of the last Ice Age itself, both in North- 
Western Europe and in North America, has been studied in considerable detail 
by means of radiocarbon measurements (Godwin 1960). This is in association 
with analysis of the pollen grains left behind by the trees, shrubs and plants of 
the period; the types which are most abundant give an indication of the prevailing 
climatic conditions. It appears that the Ice Age proper finished around 10 000 
B.C. all over North-Western Europe, but that this was a false dawn as it were, 
for from 8800 to 8300 B.c. cold conditions had returned. The evidence suggests 
that the start and finish of this final cold period were sufficiently rapid for notice- 
able climatic changes to occur in the space of a man’s lifetime. In central 
Sweden the final retreat of the glaciers had already been dated at 8100 B.c. by 
de Geer’s technique of varve-counting (see later) and the agreement with the 
radiocarbon datings is highly satisfactory to both methods. Elsewhere, and 
particularly in North America, radiocarbon measurements give 9000 B.c. as the 
start of a sudden and large increase in average temperature; these have included 
cores from the bed of the Atlantic Ocean and the isotopic ratio 018/O1 has been 
used as a measure of temperature, in addition to botanical indications. 


EEE ) oe 


eel amtieie a 


ee a | ” 


Physics Applied to Archaeology 165 


Radiocarbon dating has commercial application too. In oil prospecting it is 
sometimes difficult to decide whether oil found near the surface originates from 
a small recently-formed pocket or whether it has seeped through from a large, 
deep-lying, old deposit. Radiocarbon measurements provide the answer. 

The limiting age that can be reached is about 70 000 years. ‘This requires 
artificial enrichment of the carbon-14 relative to the carbon-12, prior to measure- 
ment, by means of thermal diffusion columns. Without artificial enrichment, 
the limit is about 50 000 years. 


Sources of Systematic Error 

Radiocarbon dating proceeds on the assumption that the 4C/!2C ratio for 
living organic matter has always been the same. Consequently, once the half- 
life of “C has been determined by precise laboratory measurement of its 
disintegration-rate, the radiocarbon chronology is an absolute one, independent 
of any necessity to calibrate with samples of known age. There are however a 
number of reasons why the basic assumption might be wrong, and consequently 
radiocarbon measurements on samples of known-age are very desirable in estab- 
lishing the accuracy of the assumption. The agreement with varve-counting 
and Egyptian dates is sufficiently close to accept the assumption as a good 
approximation—within 5% say. 

The most obvious cause for a variation in the 1*C/1#C ratio is a change in 
the cosmic-ray intensity. Aside from any cosmic influences—which would not 
be expected to manifest themselves within a mere fifty thousand years—there 
is the possibility that the strength of the earth’s magnetic field has changed. 
Charged particles approaching the earth are deflected away from it by the 
magnetic field and for any given latitude there is a certain minimum energy to 
the cosmic-ray particle. If the field used to be stronger in the past then this 
cut-off energy is higher and the 14C production-rate is lower. ‘There is evidence 
from the strength of the thermo-remanent magnetism in ancient bricks and 
pottery (to be discussed in the next article) that 2000 years ago the earth’s mag- 
netic field was nearly twice as strong (Thellier and Thellier 1959). Elsasser, 
Ney and Winckler (1956) have estimated that if the magnetic field had been 
constant up to 2000 years ago, thereafter following an exponential decrease 
to its present day value, the radiocarbon age deduced (ignoring the effect) for a 
sample of true age 2000 years, would be 2240 years. If, on the other hand, 
the expotential fall had started 4000 years ago, then the overestimation would be 
much more serious—by 1000 years for a sample of true age 4000 years. 

Precise measurements on known-age samples now extend back to 1900 B.c. 
(Ralph and Stuckenrath 1960) and the results exclude the latter possibility, but 
not the former. It seems possible therefore that the change in the magnetic 
field was part of a fluctuation and that the value averaged over several thousand 
years (the duration that would be necessary to affect the *C/C ratio appreciably) 
has stayed constant. 

Detailed measurements have also been made on wood from tree-ring sequen- 
ces over the past 1300 years (e.g. Willis, Tauber and Munnich 1960) and these 
indicate a fluctuation of 1 or 2° in the living “C/#C ratio (corresponding to a 
dating error of 80-160 years). Sharp changes in the cosmic-ray intensity due 
to sun-spot activity is one possible cause. Another is variation in the mixing- 
rates between different parts of the carbon exchange reservoir, such as the sudden 


166 M. J. Aitken 


upwelling at the surface of deep ocean water that had become deficient in 4C 
could cause a temporary depression in the 4C content of the atmosphere. 

Two recent effects are interesting. In coal and oil all the 4*C has decayed 
back to 14N and their combustion since the beginning of the industrial revolution 
had released sufficient C-free carbon into atmosphere by 1954 to depress the 
14C/2C ratio by approximately 2%. This necessitates deducing the figure for 
the specific radiocarbon activity of living matter from measurement on wood 
of known age grown prior to 1850. The ‘fossil-fuel’ effect has now been over- 
taken by production of “C in hydrogen-bomb tests. The “C/!C ratio of 
atmospheric carbon dioxide had risen to 25% above normal by 1959. | As with 
other forms of H-bomb fall-out from the stratosphere, there is a delay before the 
maximum effect is felt at ground level and even if no more H-bombs are exploded* 
the rise is expected to continue reaching a maximum of 30-40% sometime before 
1963. 

Apart from demanding additional precautions during measurement, the 
bomb-produced ™C does not affect archaeological dating. Any sample which 
is still exchanging its carbon with the atmosphere is useless anyway. 


3. 'THERMOLUMINESCENT DATING 


Suitable material for radiocarbon dating is not always easy to come by on an 
archaeological site and apart from charcoal its preservation is very dependent 
on the climate. Pottery fragments on the other hand are more plentiful and 
practically indestructible. ‘Thus the technique of thermoluminescent dating of 
pottery, at present under development by Professor G. Kennedy at the University 
of California, is of great importance. 

Uranium and thorium are present in clay as trace impurities—the amount 
varies but is usually of the order of a few parts per million. Alpha-particle 
emission from these elements causes radiation damage in the clays, i.e. lattice 
defects are created in the crystal structure of the constituent minerals. Electrons 
are trapped in these defects and if the damage is annealed subsequently, by 
heating the clay to several hundred degrees centigrade, the release of the trapped 
electrons is accompanied by the emission of a minute amount of visible light. 
The amount is proportional to the number of defects and it can be detected and 
measured by means of a specially selected low-noise photomultiplier and D.C. 
amplifier. In pottery the act of baking has released all previous damage. 
Consequently the thermoluminescent output is proportional to the time that has 
elapsed since baking, i.e. the age of the piece of pot. In practice it is found 
necessary to irradiate the pottery with x-rays before measurement; during the 
course of centuries the electrons originally in the traps have escaped by wave- 
mechanical penetration, and it is necessary to re-fill them. 

To deduce the actual age two subsidiary measurements are necessary, 
firstly the specific alpha-activity of the fragment concerned and secondly its 
susceptibility to the acquisition of radiation damage—by exposure to a beam of 
ten-million-volt protons for example. An unknown factor still remains: the 
relative effectiveness of protons and alpha-particles in producing damage. This 
is deduced by measurements on a well-dated pot and the assumption is made 
that the ratio is the same for all clay. 


* This was written before the Russian tests of September 1961. 


Physics Applied to Archaeology 167 


4. DATING BY COUNTING 

The techniques of radiocarbon dating and thermoluminescent dating are 
analogous to finding the height of a staircase by use of a barometer. Alternatively 
one counts the number of steps and multiplies by the unit height. Comparable 
with this are dating by tree-ring counting (dendrochronology), varve-counting, 
and the recently developed glass-layer counting. 

Counting the annual growth rings in the cross-section of a tree is a very 
obvious dating method and first archaeological application was made in 1811: 
De Witt Clinton, when examining the earthworks near Canadaigna in New York 
State, counted the rings in the trees growing on them and estimated that they 
were one thousand years old and hence not the work of Europeans or present-day 
Indians. Wet summers produce thick rings and dry summers narrow ones. 
Consequently a distinctive sequence in annual rainfalls gives rise to a distinctive 
sequence in the thickness of the annual rings. ‘This makes it possible to carry 
the dating back through several generations of trees; starting with a recently- 
felled tree, a distinctive group of early rings is matched to the correspondingly 
group in an earlier tree and so on. 

Similar to tree-rings are the annual varves (i.e. layers) found in clays that 
originated in the beds of lakes dammed-up by glaciers. During the summer 
melting is more rapid than in winter and the sediment deposited is coarser and 
thicker. Varve sequences are recognisable in the same way as with tree-rings 
and by working backwards with matched sequences Baron de Geer was able to 
arrive at the absolute date of 8100 B.c. for the end of the last Ice Age—the 
agreement with radiocarbon dating has already been noted. 

The colourful iridescence often exhibited by old glass has been the subject 
of study for some time, beginning with Brewster in 1863. The colours arise 
from interference phenomena in surface crusts between 0-0003 mm and 0-015 mm 
thick. A recent study of the crusts on specimens of known history by Brill and 
Hood (1961) of the Corning Glass Museum in New York has shown that these 
crusts are annual, like tree-rings and varves. The mechanism suggested is 
the selective leaching of the alkali oxides leaving a hydrated residue of high 
silica content. Variation in temperature or availability of water, such as would 
occur seasonally, causes an irreversible change in the siliceous residue. Glass 
of high soda or potash content is particularly subject to such deterioration, and 
the process is favoured by burial in wet conditions. 


5. SOME OTHER DATING METHODS 

Geological 

By successive stages of radioactive decay uranium-238 forms lead-206. 
The effective half-life is 4560 million years. By measuring the amount of lead 
relative to uranium-238 in a rock, the time of formation can be deduced—the 
older the rock the greater the lead content. There are several such techniques 
used in geology but since the radioisotopes concerned are naturally occurring 
elements present in the earth’s crust at its formation, any that might have had a 
half-life comparable with archaeological ages will have disappeared long ago. 
There is however one geological technique that may be of archaeological interest: 
the decay of potassium-40 to argon-40. The half-life is 1330 million years, 
but with very sensitive measuring techniques it has been possible to measure the 
amount of argon formed in volcanic lavas which are only a few hundred thousand 


168 M. J. Aitken 


years old. This method would extend absolute dating backwards from the limit 
of radiocarbon at 50 000 years and cover a period of great importance anthro- 
pologically. Recently the technique has been used at the University of California 
to obtain a date for the fossilized remains of zinjanthropus (nicknamed the 
Nutcracker Man on account of his massive teeth and jaws), found by Dr. Leakey 
in the Olduvai gorge in 1959. The age obtained is 1 750 000 years—about a 
million years older than previous estimates. However this revised age is not 
in fact discordant with the growing belief that the Pleistocene period started 
very much earlier than had been accepted in the past. 

[Note in proof—results now obtained for the basalt underneath the fossil bed 
give an age of only 1,300,000 years]. 


Dating of bone 

Fluorine and uranium are gradually acquired by bone through alteration of 
the phosphatic mineral (hydroxyapatite) of which bones are mainly composed. 
Unfortunately the rate of acquisition depends on the amount of those elements in 
the ground-water which has percolated through the deposits and the same 
percentages of fluorine or uranium in bones buried under different conditions 
do not mean that the bones are of the same age. However, relative dating of 
bones found in the same deposit is possible and such techniques were used in 
showing that the Piltdown Man was a fake. Whereas the Swanscombe skull 
contained 1-7° of fluorine and 27 parts per million of uranium, the Piltdown 
skull contained only 0-1% and 2 parts per million respectively, compared to 
0-039 and zero parts per million for modern bone. Measurement of the 
nitrogen content is also useful because this decreases with the gradual disappear- 
ance of collagen. ‘The nitrogen contents of the above were nil, 3-9°4 and 4:0% 
respectively. 

A novel method of telling old bones from modern fakes is by measurement of 
the velocity of sound in bone—this decreases with age. The velocity in bones 
500 years old is half that of modern bone and so the two are easily distinguished. 
The velocity falls only to one quarter of the present day value for 5000 years old 
bone so that precise dates are not easy. 


6. ANALYTICAL TECHNIQUES 


Dating has a particular fascination for the layman and the older an object is 
shown to be the greater is his sense of awe. For the archaeologist, dates make 
up the basic framework which he clothes with the characteristics of the civiliza- 
tions he is unearthing. Although the framework is important it is not the end- 
product; in so far as it is definable, the end-product is an understanding of the way 
in which civilized society first developed and how successive civilizations and 
cultures have waxed and waned. Cultures are classified primarily according to 
the principal material from which their tools and weapons were made—Stone 
Age, Bronze Age and Iron Age. Secondarily, a distinctive object or style of 
object is used—e.g. basket-maker Indians, ‘ beaker ’ folk of the New Stone Age. 
Having exhausted classification in terms of visual characteristics, further sub- 
division is possible according to chemical composition, for instance, and the 
archaeologist has sought the co-operation of the analytical chemist for many years. 
Chemical analysis—particularly of trace impurities—by means of techniques 
from physics has developed enormously in the last decade. Some of these 


eg eee 


Physics Applied to Archaeology 169 


techniques are non-destructive or only slightly damaging to the specimen, and 
so are particularly apposite to archaeological research. 

The first three techniques to be mentioned—optical emission spectroscopy, 
X-ray fluorescent spectrometry, and neutron activation analysis—involve 
identification of an impurity element by detection of electromagnetic radiation of 
a characteristic wavelength that has been emitted, after suitable excitation, by 
the outer electron shells, the inner electron shells and the nucleus respectively. 
In principle all methods of analysis are applicable to archaeological research 
and it is a question of choosing the method which best fits the problem. 


Optical emission spectroscopy 

In this well-established technique about ten milligrams of the specimen are 
introduced into an electric arc or spark. Excitation and ionization of the outer 
electron shells causes the emission of visible light of sharply defined wavelengths. 
The atoms of a given element emit wavelengths which are characteristic of that 
element and the intensity of those wavelengths can be used as a measure of the 
percentage of that element present. The light emitted from the spark is split 
up into its constituent wavelengths by a quartz prism and detected by a photo- 
graphic plate. The degree of blackening corresponding to each wavelength is 
measured with a microphotometer and the concentrations of the various elements 
in the specimen are deduced by comparison with standards of known composition. 
Concentrations down to a few parts per million can be determined. 

This technique has been applied on a large scale to ancient copper and bronze 
implements by Junghans and Sangmeister (1957). By visiting museums through- 
out Western Europe several thousand samples (obtained by drilling a tiny hole in 
the implement concerned) have been obtained. Complex statistical methods 
have been used to divide the analyses obtained into twelve groups, each of 
which has a characteristic composition. These groups appear to originate from 
different geographical locations, and by linking this with the archaeological 
context in which an implement is found useful deductions about the spread of 
cultures in the Early Bronze Age can be obtained. 

The method is not generally applicable to later periods of the Bronze Age 
because of the practice of re-using scrap metal. Nevertheless by analysing 500 
implements and weapons of the British Middle and Late Bronze Age, it has been 
shown (Brown et al. 1959) that the deliberate addition of lead (frequently 4-7%) 
characterizes metal of the Late Bronze Age—either to facilitate casting and 
working or for economy in tin or copper. 

Another major application of optical emission spectrography has been to the 
baked-clay of Roman pottery, (Richards and Hartley 1960). This was initiated 
by analysing samples scraped from mortaria—distinctive bowl-like vessels having 
the inner surface studded with grit for the purpose of pounding or grinding 
food—on the bases of which it was customary for the potters to stamp their 
names. It was found that the composition of the clay used by a given potter 
was fairly consistent and that there was correlation between potters known to 
be working in the same area—as might be expected since it would be uneconomic 
to transport clay. Extending the analysis to colour-coated (or Castor) ware, 
characteristic compositions have been established for different pottery producing 
centres in Britain (Thus, for Colchester: Na,O, 0-4% ; MgO, 1:6% ; MnO, 0-30% 
iO #1397, CaO;-2-297; FeO, 7-3; K,O, 3-7%. Whereas for Great Casteron 


170 M. J. Aitken 


in Lincolnshire; Na,O, 0:16%; MgO, 0:42%; MnO, 0:01%; TiO,, 24%; 
CaO, 0:75%; FeO, 3:5%; K,O, 2:1%). While considerable variation from the 
average is to be expected, the differences for some oxides (e.g. CaO in the examples 
quoted) is sufficiently striking to allow reliable attribution of a fragment of pottery 
to one of several possible sources. 

On a larger scale it should be possible to build up a general picture of the 
distributive organisation of the pottery trade in Roman Britain. For example, 
by analysing a selection of pottery found in a fort on Hadrian’s wall it is possible 
to find which pottery centres were the principal suppliers to the section of the 
Roman Army. Further afield it is hoped to bring additional evidence on the 
Palmer-Evans controversy over the decline of Knossos. If it is possible, by 
analysis, to distinguish pottery produced in Knossos from that produced on the 
Greek mainland, then examination of the ware exported around the Mediter- 
ranean should determine whether during the period 1400-1200 B.c. Knossos 
was an active exporting centre as Professor Palmer maintains or whether it had 
declined by then to a squatter civilisation as Sir Arthur Evans deduced by direct 
archaeological observation. 


X-Ray Fluorescent Analysis 

The x-rays emitted when the inner electron shells are excited (by irradiation 
with the continuous spectrum from a 50 kv x-ray tube for example) lie in the 
wavelength range 0-2 to 20A (1[A=10-§ cm) and may be split into constituent 
wavelengths by means of a diffraction crystal. In quartz, for example, the 
spacing between the planes of the crystal lattice is 3-344 and appreciable reflection 


X-ray tube 


Collimator 


Detector 


Fig. 2. Layout of x-ray fluorescent spectrometer. (Reproduced from Physics and 
Archaeology by courtesy of Interscience Publishers Ltd). 


from the face of the crystal only occurs when the angle of incidence is such that 
the extra path travelled by x-rays reflected from successively deeper-lying planes 
is a whole number of wavelengths. he reflected x-rays can be detected by a 
photographic plate, or more conveniently by a Geiger counter, gas-proportional 
counter, or scintillation counter. The crystal and detector are orientated (by 
suitable mechanical gears) so that both the collimator and detector always make 
the same angle @ with the crystal face. The concentration of a given element is 


Physics Applied to Archaeology 171 


determined by measuring the output of the detector when it is at a value of @ 
that corresponds to an x-ray wavelength of that element. As with the optical 
spectrometer, calibration with standards closely resembling the specimen is 
necessary. 

Because X-rays are strongly absorbed by matter the analysis relates only to a 
thin surface layer of the specimen—roughly 0-01 to 0-1 mm thick. Consequently 
it is very suitable for the analysis of the glaze on pottery. The classical example 
of this type of application is the work of Dr. Stuart Young on Chinese blue- 
and-white porcelain, using the automated x-ray spectrometer developed by 
Dr. E. T. Hall at the Oxford Archaeological Research Laboratory. This 
application illustrates two particular advantages of x-ray fluorescent analysis—its 
non-destructiveness, which permitted examination of valuable porcelain, and 
its rapidity, which enabled some eighty pieces to be dealt with in half as many 
hours. 


KEY: @ CHINESE PIECES @ PERSIAN PIECES 


MANGANESE/COBALT RATIO 


O, 
1300. 1400 1500 1600 1700 1800 1900 


DATE-A.D. 


Fig. 3. Analysis of Chinese blue-and-white porcelain. Native cobalt ore in China is 
rich in manganese and consequently a low manganese/cobalt ratio indicates the use 
of imported pigment. (Reproduced from Physics and Archaeology by courtesy of 
Interscience Publishers Ltd). 


Chinese blue-and-white porcelain was made by pre-Ming, Ming and Ching 
potters over a period of several centuries. Variations in the quality of the 
colour between contemporary pieces had been ascribed to the use of imported 
cobalt ores for the more refined pieces, and the use of native cobalt ores for the 
less refined ones. The cobalt ores occurring in China contain, without exception, 
a high proportion of manganese, while those of Persia—the presumed source 
of importation—and the Middle East are managanese-free. The hypothesis 
that imported pigment was responsible for high quality was tested by measuring 
the ratio of manganese to cobalt in an area of blue decoration of each specimen. 
There was no correlation between high quality and low manganese, SO that the 
hypothesis was disproved. However it was discovered that the use of imported 
ore is chronologically significant, its use ceasing by the end of the sixteenth 
century and native pigment not being used until the beginning of the fifteenth. 


172 M. J. Aitken 


The results shown in fig. 3 suggest incidentally that the authenticity of blue-and- 
white porcelain ascribed to pre-1400 and post-1600 can be checked in this way. 


Neutron Activation Analysis 

The analysis obtained by x-ray fluorescence refers only to a thin layer so 
that, although the technique is particularly suited to the examination of glazes, 
it can give misleading results when applied to coins. ‘This is because corrosion, 
surface enrichment of impurities, and deliberate plating can make the surface 
layer different in composition to that of the coin as a whole. Neutron activation 
analysis on the other hand is ideally suited to coin analyses, for both the neutrons 
used as exciting agent and the characteristic gamma-rays subsequently emitted 
are only weakly absorbed in matter. The main drawback is the need for a 
nuclear reactor to provide the intense neutron flux—about 101” neutrons/cm? x 
sec—that is necessary in order to obtain any useful sensitivity. 

Large-scale application of this technique by Miss Vera Emeleus had made 
an important contribution to knowledge possessed by economic historians and 
numismatists about the silver coinage of ancient Greece. About 600 coins 
have now been analysed—obviously only possible with a method that is entirely 
non-destructive. The procedure is to insert a can, containing about a dozen 
coins, into one of the irradiation channels of the reactor BEPO at Harwell. On 
withdrawal (after a time that may be as short as five minutes or as long as five 
weeks according to the half-lives of the impurities of interest) the coins are 
weakly radioactive, and the emitted radiation is studied with a gamma-ray 
spectrometer. A conventional sodium iodide scintillation crystal and photo- 
multiplier is used. 

The two principal impurities in the Greek silver coins were gold and copper. 
The nuclear reactions concerned in their detection are: 

IN Ay+n —> 18Au _27 days 198g 
12-8 hours 


8Cu +n —> “Cu —————> ®Ni 


The formation of stable 8Hg is accompanied by the emission of a 0:41 mMev 
gamma-ray and of *4°Ni by a 0-511 Mev gamma-ray. The detection of these 
energies of gamma-ray shows the presence in the coin of gold and copper 
respectively and the intensities are proportional to the concentrations. 

The results are interpreted on the basis that the gold concentration is deter- 
mined by the ore from which the silver was extracted and the copper is evidence 
of the smelter’s skill in refining or of the treasury’s wile in deliberate alloying. 
For although traces of gold could not be separated from silver by the ancient 
Greeks, most of the copper would be incidentaly removed by the process of 
cupellation (heating in a blast of air) which was used to get rid of the lead— 
Athenian silver ocurred in galena which is still mined today, for the lead, at 
Lavrium where some of the ancient apparatus can be seen. 

When the programme of analyses was started the principal results hoped 
for concerned the source of the ore from which the various Greek cities extracted 
their silver. For instance, in the fifth century B.c., did Corinth, with no 
local source of silver, obtain it entirely from Athens or did the varying degrees 
of enmity between the two cities prevent this? Figure 4 shows the evidence 
obtained. This strongly suggests that Corinth obtained some but by no means all 


Physics Applied to Archaeology 173 


ATHENS 


COPPER CONTENT—Y, 


0:00! 0-002 0:005 0:01 0:02 005 0-1 02 OB Tk) 
GOLD CONTENT—Y, 


CORINTH 


} 


% 


05 


0-2 


COPPER CONTENT— 


0-02 


0-01 
0:00! 0-002 0-005 0-01 0:02 0-05 0-1 02 


GOLD CONTENT % 
Fig. 4. Neutron Activation Analysis: gold and copper contents of silver coins of (a) Athens 
and (b) Corinth in the 5th century B.C, 


174 M. J. Aitken 


her silver from Athens—it should be mentioned here that of all the silver coinages 
studied Athens and Macedon alone show the distinctively low gold content of 
less than 0-1% and the latter state is an unlikely source for Corinthian silver on 
historical grounds. 

In addition to the expected information about sources of ore, some rather 
striking and unexpected light was thrown on coinage manipulations. While no 
other cities could rival the refining skill of the master-smelters at Lavrium in 
reducing the copper to below 1%, it is evident from the analyses that their 
techniques were capable of reaching 1 or 2%. For example, of 50 silver 
tetradrachms minted at Syracuse during the fifth century B.c., those of the first 
quarter contained little more than 1° copper. However this figure increased 
steadily to nearly 5° during the second quarter, but fell back to a predominance 
of coins having less than 1% copper in the middle of the century. In the 
third quarter the copper content again increased and fluctuated between 1% 
and 2% for the remainder of the century. ‘These are interpreted as deliberate 
fluctuations, either to increase durability or to save silver. The debasement in 
the second quarter could reflect financial difficulties of the Syracusan tyrants 
before their expulsion in 461 B.c. At Macedon too, there is clear evidence of 
deliberate debasement. ‘Throughout the fifth century, through the reign of 
three kings, a certain denomination—the tetrobol—was issued in two forms. 
On one the horse on the obverse bears a rider, on the other there is no rider. 
The copper content of the ‘ with rider’ type varied between 0-02 and 0-2% and 
bears witnesss to the ability of the Macedonian smelters throughout the century. 
The ‘ without rider’ coins on the other hand all contain between 5 and 20% of 
copper. Evidently the Macedonian kings realized the profit to be gained from 
a token coinage long before nineteenth century economists. 


Scattered 
Geiger /3-rays 
counter 


Radio isotope 
in container 


Primary 
[-rays 


Fig. 5. Analysis by beta-ray back-scattering. (Reproduced from Physics and Archaeology 
by courtesy of Interscience Publishers Ltd). 


Beta-ray back scattering 


When beta rays are incident on a surface, some are absorbed but others 
emerge again. This ‘back-scattering’ is strongly dependent on the atomic 
numbers (Z) of the constituent elements, because the scattering arises from 
deflection of the electrons by the charge carried on the nucleus. The apparatus 


Physics Applied to Archaeology 175 


required to measure the relative back-scattering coefficient of a surface is com- 
paratively simple and cheap to construct (about £30). A few microcuries of 
a suitable radioisotope, e.g. thallium-204 (half-life several years, beta-ray energy 
0-76 Mev), provides the beta rays, and a geiger counter detects the back-scattered 
intensity. The voltage for the Geiger is provided by a transistorized power 
supply and the counts per minute received by the Geiger are indicated by a 
transistorized ratemeter built into the same box. 


Fig. 6. ‘The beta-ray back-scatter meter 


The most useful application is in detecting the lead (Z = 82) content of glass 
and glazes. ‘The instrument is sensitive to the presence of 5°4 or more of lead 
and it can be calibrated by reference to standards. ‘The question to be 
answered is usually whether lead has been added deliberately, in which case the 
lead content is well above 10%, or whether the glass is of a non-lead type—lead 
being entirely absent or present only as a trace impurity. Consequently the 
poor sensitivity is unimportant and more than compensated for by the port- 
ability of the instrument which makes it possible to establish whether a glass or 
glaze is lead or non-lead without removing the specimen from the museum shelf. 
The instrument does not of course distinguish between lead and other high 
atomic number elements. Of these the most likely in glazes, is tin (Z=50), and 
its presence can usually be inferred on other grounds. 

Besides the determination of lead content, the technique has been used in 
Japan to determine the thickness of gilt on a statue of Buddha. Since the base 
material had a low atomic number the scattered intensity was dependent on the 
gilt thickness (Z=79 for gold) and calibration was obtained from gilt layers of 


known thickness. 


176 M. J. Aitken 


REFERENCES 
Britt, R. H. and Hoop, H. P., 1961, A new method for dating ancient glass, Nature, 189, 
12-14. 
Brown, M. A., BLin-Stoye, A. E., JoLowicz, R. V. and HUMPHREYS, CHCs 1959504: 
eariple ASS of British Middle and Late bronze age material using oral spectro- 


metry. Proc. prehist. Soc., XXV, 188-208. 
Exsasser, W., Ney, E. P. and WINCKLER, J.R., 1956, Cosmic ray intensity and geomagnetism, 


Nate. Lond., 178, 1226-1227. 
Gopwin, H., 1960, Po oenes dating and quarternary history in Britain, Proc. Roy. Soc., 


B. 153, 287-320. 

Juncuans, S. v. and SANGMEISTER, E., 1957, Report on the progress of spectroscopic 
investigations of archaeological discoveries in Europe of the copper and early bronze 
ages, Germania, 35, 11-18. 

Lippy, W. F., 1961, Radiocarbon dating, Science, 133, 621-629. 

Lipsy, W. F., 1955, Radiocarbon dating, 2nd edn., University of Chicago Press, Chicago. 

Ratpu, E. K., SrucKENRATH, R., 1960, Carbon—14 measurements of known age samples, 


Nature, Lond., 188, 185-187. 

Ricuarps, E. E. and Harttey, K. F., 1960, Spectrographic analysis of Romano-British 
pottery, Nature, Lond., 185, 194-196. 

THELLieER, E. and THELLIER, O., 1959, Sur lintensité du champ magnétique terrestre dans 
le passé historique et géologique, Ann Géophys., 15, 285-376. 

Wis, E. H., Tauser, H., MUnnicu, K. O.,1960, Amer. 7. Sci., Radiocarbon Suppl., 2, 1-4. 


The American Association of Physics Teachers: 
Demonstration Experiments in Physics 


Contributions are now being solicited by the Demonstration Book Com- 
mittee of the American Association of Physics Teachers for a new text entitled, 
“A Reference Source for Demonstration Experiments in Physics.”’ The text 
will consist of detailed descriptions of demonstration equipment and their use 
in lectures, lists and sources of materials and diagrams to make construction by 
individuals possible. In addition there will be invited articles on selected 
subjects such as the role and purpose of lecture demonstrations, and the use of 
audio-visual techniques. 

Emphasis in the selection of material will be put on demonstrations related to funda- 
mental concepts. Some examples are: modern standards of length and time; frictionless 
pucks; kinetic theory of quantum effects, such as the Franck—Hertz experiment and the 
diffraction of electrons; demonstration of the microscopic properties of solids, such as 
magnetic domains, dislocations, and the Hall effect; scattering and resonance phenomena; 
electric and magnetic field patterns; use of models e.g. of crystals, accelerators, and particle 
detectors ; wave phenomena; analogue techniques, e.g. microwave representation of X-ray 
diffraction; satellite tracking and the Doppler effect; the velocity of light; the conservation 
laws of energy, momentum and angular momentum. 


Support for the preparation of the book has been provided by the National 
Science Foundation. ‘The contributions of all individuals will be recognized 
in their articles and a complete list of all contributors will be included in an 
appendix. 

A brochure, including detailed information on all aspects of the preparation 
of material to be submitted for inclusion and containing sample demonstration 
contributions can be obtained from Harry F. Meiners, A.:A.P.T. Demonstration 
Book Project, Science Center, Rensselaer Polytechnic Institute, Troy, New York. 
Contemporary Physics has some copies available for prompt distribution, on 
application to the Editor. 

Contemporary Physics has a supply for distribution in Britain; they are 
available from the Editor at Uppingham. 


The Molecular Approach to Biology 


by D. M. BLOW 
Molecular Biology Research Unit, Cavendish Laboratory, Cambridge 


SUMMARY 

Some of the essential activities of a living cell are considered. These 
include the storage and transfer of information, the control of synthetic 
mechanisms, and the fabrication of specific molecules to carry out various 
biochemical processes. These activities depend on two types of polymer, 
found in all living organisms, the nucleic acids and the proteins. Knowledge 
about the molecular structure of these polymers, and of their behaviour, 
enables us to make a rudimentary physical picture of the mechanisms 
involved. 


1. INTRODUCTION 

Physical science advances biological knowledge in many ways. The biologist 
can be given new physical tools (biological physics), and useful measurements 
can be made on living systems (physiology). But at a deeper level, we can now 
begin to describe some fundamental aspects of vital processes in terms of 
physical models on the molecular scale. The nature of this description will be 
indicated in the following pages. 

The nearest we have come, in modern technology, to fabricating a living cell 
is, IN my opinion, in making plans for an automatic factory. An integrated 
system which can take in raw materials from the surroundings, adapting its 
behaviour according to the supply position, and convert them into the required 
products, would begin to fulfil some functions which the cells of the most 
primitive organisms mastered. Indeed, if the product which the factory turned 
out were identical automatic factories, the robot age of science fiction might have 
begun. 

Taking then, as an engineering definition of a living cell, ‘A completely 
automatic factory for fabricating automatic factories like itself ’, we may profitably 
consider what components might be found in such a system. — Passing over such 
trivia as a power station for utilizing whatever energy source might be available, 
it is clear that a large computer would be the control mechanism at the centre 
of our design. In its store would be an encyclopaedia of programmes which 
would give the proper response to all possible sets of external circumstances, 
and these would be activated by input devices which would record the external 
conditions and the supply position. Other input channels would monitor the 
progress of the various factory processes, forming the feedback loops which are 
essential to control mechanisms. Output from the computer would go, perhaps 
via punched paper tape, to a set of automatic machine tools which would perform 
the various operations required for construction of a duplicate factory. Here 
the complex task of converting the information stored in the computer into 
solid matter would be performed. 

A crucial part of the construction of the duplicate factory will come in the 
copying of the encyclopaedia from the memory of the original computer into the 
newly built one. It might be thought necessary for this process to be thoroughly 
checked and re-checked, so that no error can arise. However, if we are willing 

CP. M 


178 D. M. Blow 


to throw caution and economy to the winds, a better result will be achieved by 
allowing a small degree of unreliability. 

This is because the new factory, once made, will be competing with the old 
for supplies, and if it is a perfect copy they will compete on even terms. If there 
is an error in copying, it will probably be at a disadvantage, but after many trials 
there may arise, by chance, an improved version. In time, we may safely 
suppose that the supply position will become difficult. Inefficient factories will 
be torn down to provide more raw materials and only the most efficient factories 
will survive. 

2. ‘THE HIGH POLYMERS OF MOLECULAR BIOLOGY 

There is no need to take the analogy further. Instead of steel and copper, 
with traces of silicon and germanium, which we might have chosen for such a 
device, Nature uses organic materials, and has developed in particular three 
high polymers which carry out the most essential phases of her task. Unlike 
the high polymers which are synthesized industrially, these polymers are copoly- 
mers of definite sequence: although they are made by linking similar units into 
a chain, the units are not identical, and they are arranged in a specific order. 


Fig. 1. The four bases of DNA. A=adenine, T=thymine, G= guanine, C=cytosine. 
A and G are purines; T and C are pyrimidines. The dotted lines indicate hydrogen 
bonds which make the pairing indicated energetically favourable. Only the two 
base-pairs shown occur in DNA. (After F. H. C. Crick and J. D. Watson, Proc. 
Roy. Soc., A, 223, 80, 1954.) 


The Molecular Approach to Biology 179 


There are two main types of polymer to be considered. One type is used for 
the storage and transfer of information, and for this purpose a polymer is required 
whose properties are virtually independent of the information written on it, 
just as one page of a book is very like another. Enough effect is required to 
enable the information to be ‘ read’, and no more. The polymers which are 
used for this are known as nucleic acids and their fundamental formula is 


phosphate 
~S 


sugar 


va 
phosphate 


BS 


sugar 


ie 


phosphate 


base 


base 


The ‘ backbone’ of the nucleic acid consists of sugar rings linked as esters 
through phosphate groups. ‘Two varieties of nucleic acid are found; in one the 
sugar involved is ribose, and in the other, deoxyribose. The polymers are 
called, briefly, RNA (ribonucleic acid) and DNA (deoxyribonucleic acid). In 
each case, the information is written into the sequence of bases. At each point 
in the chain any one of four basic groups may appear. The bases which appear 
in DNA are illustrated in figure 1, and will be referred to briefly as A, T, G, and 
C. The two larger ones A and G are called purines; T and C are pyrimidines. 

A quite different type of polymer is needed for the fabric and working 
material of the cell. Here we require the largest possible diversity of properties 
consistent with the continuity of a single type of chain. ‘The proteins are 
naturally occuring examples of compounds called polypeptides, whose funda- 
mental formula is 


\ 
C=O | 
side chain—C one amino acid 
Neon 
Te 
O=C 


XS 


C—side chain 


H—N 
mS 


The repeating unit is an amino acid and in proteins twenty different types of 
amino acid are found. Some of the side chains are positively charged under 
physiological conditions, others negatively; some are hydrophobic, others 
hydrophilic; some can be donors or receivers in hydrogen bonds; some contain 
aromatic and other ring systems, others contain sulphur, and so on. 


180 D. M. Blow 


3. THE STRUCTURE OF THE GENETIC MATERIAL 


It is not difficult to find in the cell a structure which corresponds to the 
encyclopaedic information store at the centre of the automatic factory. Virtually 
all cells contain a central nucleus which is packed full of DNA (combined with a 
special protein). In higher organisms, the DNA is further divided into a number 
of chromosomes, which under the microscope are seen to be very highly organized 


Fig. 2. The structure proposed by Watson and Crick for DNA. The Wires represent th 
backbones of the two intertwining helices, the five-sided rings being the su : 
residues. Attached to these are the bases (shown as flat, dark plates) which link 
the two chains together by hydrogen bonds (white areas). (F H. C. Crick and D 
Watson, Proc. Roy. Soc., A, 223, 80, 1954.) Re Bn 


The Molecular Approach to Biology 181 


structures, with a fascinating division cycle of their own. Many experiments 
have shown that the nucleus is the site of the genetic identity of the cell. 

The four bases of DNA evidently act as a four-letter alphabet in which this 
genetic encyclopaedia is written. Typical DNA molecules have molecular 
weights of a few million, and contain 104 or more letters of this alphabet. Within 
one organism, the total DNA per cell seems to be constant; in man a haploid cell 
nucleus contains about a metre of DNA, or about 6.10® bases. Written in a 
four-letter alphabet its information content is comparable with the Encyclopaedia 
Britannica. In the embryonic tadpole, cells are dividing every hour, each 
daughter cell taking with it one copy of the encyclopaedia, and it is clear that the 
cell has a formidible printing job on hand. 

The clue to the copying mechanism came from an x-ray diffraction study of 
fibres prepared from purified DNA In 1952, the full details of diffraction by a 
regular, discontinuous helix, such as a helical arrangement of atoms, were 
worked out for the first time. Crick and Watson realized that the x-ray diffrac- 
tion pattern of DNA provided a clear example of diffraction from a helix. 
However, the adjustment of the structure so as to fit the observed diffraction 
pattern caused great difficulty. A very compressed arrangement of atoms was 
needed to fit the required chemical structure into a helix of the kind indicated. 
Crick and Watson found that it could only be done in one way. The DNA 
molecule was not composed of one polymeric strand, but of two, twisted together 
like the strands of a rope. ‘The bases were stacked flat between them, like the 
treads of a spiral staircase. The two ‘ backbones’ of sugar-phosphate groups 
were too close together to accommodate two of the large purine bases between 
them, but could just accommodate one purine and one pyrimidine. A very neat 
arrangement was found in which a purine and a pyrimidine could be linked 
tightly by hydrogen bonds, but in this arrangement A could link only with T, G 
only with C (figure 1). The sequence of bases in the two chains is complemen- 
tary, so that wherever A appears on one strand T appears on the other, and so 
on. This arrangement requires that the total amounts of purine and pyrimidine 
in a DNA molecule are equal, and this is confirmed by chemical analysis. The 
proposed structure (illustrated in figure 2), gave excellent agreement with the 
observed diffraction pattern. 


4. THE DNA COPYING MECHANISM 

The structure gave a very suggestive clue about the way in which the copying 
mechanism worked during cell division. The two strands are only weakly 
linked by hydrogen bonds, but the sequence of bases in one strand is completely 
determined by the complementary strand. If, on cell division, one strand went 
to each daughter cell, it seemed reasonable that each could then reform a 
complete, double-stranded molecule again. 

That the DNA of a cell is equally divided between its progeny has been 
proved in a number of ways, but most clearly in an experiment by Meselson and 
Stahl. They had recently perfected a technique for separating large molecules 
of slightly different density by sedimentation equilibrium. A solution (usually 
of caesium chloride) is made up to exactly the required density, and is then 
subjected to a centrifugal field of the order of 10°g in an ultracentrifuge. ‘This 
field is sufficient to force the heavy caesium ions significantly outwards, and the 
result is to set up a slight density gradient in the solution. Any large molecules 


182 D. M. Blow 


in the solution will be forced towards a level corresponding to their own density ; 
moreover the mass of these large molecules makes their mean square thermal 
velocity very small, and once they have reached the correct level the tendency to 
diffuse away is slight. Thus a rather sharp band is formed, containing all large 
molecules of the same density. 

In Meselson and Stahl’s experiments, bacteria were grown in a medium where 
a large proportion of the available nitrogen was N°, They were then transferred 
to a medium containing N, so that newly synthesized DNA would contain this 
isotope. DNA was prepared from the first few generations of daughter cells, as 
well as from the original N!® stock. Each DNA preparation was then brought 
to sedimentation equilibrium in a density gradient, with the results shown in 
fig. 3. The first generation DNA is homogeneous, but has a slightly lower 
density than the original ‘heavy’ stock DNA. About half the second genera- 
tion had the same density, while the remainder had even less. In subsequent 
generations, the proportion of lowest density DNA, containing only N*™, 
increased. This is exactly what would have been predicted by the hypothesis 
that the strands separate at each cell division and each is used to make a 
complete new molecule. It is worth noting that the total change in density 
for the DNA is less than one per cent, indicating the sensitivity of the sedimenta- 
tion equilibrium method. 

An enzyme was discovered by Kornberg which takes our knowledge of the 
process one step further. In the presence of this enzyme, and with a supply of 
suitable raw materials, a single-stranded DNA spontaneously ‘ grows’ a new, 
complementary chain. ‘This has so far been proved only for simple synthetic 
polymers which are chemically like DNA, but there is every reason to suppose 
that the enzyme would work equally well for a general base sequence. 

It has been shown in Doty’s laboratory at Harvard that two complementary 
DNA chains are capable of pairing spontaneously if given sufficient time to 
‘search’ for the proper method of joining up. ‘The experimenters studied the 
optical rotatory properties of DNA solutions at increasing temperatures, and 
found that there is a temperature (the ‘ melting out’ temperature) where the 
optical rotation changes abruptly, indicating that the regular, helical coiling has 
broken up, and the two chains have come apart. ‘The transition is sharp because 
the DNA molecules are large and the ‘ melting’ is a cooperative effect, just as 
in a melting crystal. If the mixture is quickly cooled below this temperature, 
bases pair at random, without change of optical rotation. If, however, the 
solution is taken through the melting-out temperature very slowly, the optical 
rotation returns to that of the original, helical DNA solution. ‘These experiments 
show that the energy of the hydrogen bonds involved in the specific base-pairing 
scheme is sufficient to stabilize the double-helix form of DNA, in preference 
to other possible forms, under conditions similar to those in the living cell. 

In this way we can form a rudimentary, but coherent, picture of the way in 
which the pool of genetic information, a kind of encyclopaedia of racial lore, is 
handed down to every nascent cell. ‘This information, comprising ‘ the genes ’ 
is written in the 4-letter alphabet of DNA, and Kornberg’s enzyme is probably 
used to make a copy at each cell division. It is easy to envisage ways in which 
mutation can arise, by ‘ mistakes’ in the copying mechanism; and this has to 
some extent been demonstrated by comparing the genetic effects of different 
kinds of chemical mutagen. 


The Molecular Approach to Biology 183 


Parent DNA 


to 


1st generation 
DNA 
2nd 
generation 
DNA 


Fig. 3. Results and interpretation of Meselson and Stahl’s experiment. On the left are 
ultraviolet absorption photographs of the ultracentrifuge cell, after sedimentation 
equilibrium has been established. They show clear separation between the DNA 
of various densities. The interpretation of the results is given on the right. The 
original DNA contains many heavy nuclei, indicated by a cross. During each 
generation the DNA splits into two separate chains, and a new complementary DNA 
is attached, containing only stable (light) nuclei. (M. Meselson and F. W. Stahl, 
Proc. Nat. Acad. Sci., 44, 671, 1958.) 


x 


184 D. M. Blow 


5. THE GENETIC MAP 


Geneticists determined that the hereditary information exists in linear form 
without reference to a physical model of any kind. When offspring arise from 
more than one parent, as in sexual reproduction, it is found that the character- 
istics inherited from one parent are not randomly distributed. If a certain trait 
A is inherited from one parent, then another trait B of the same parent may have 
more than a random probability of inheritance. If the probability is found to 
be high, A and B are said to be ‘ closely linked’. These experiments are scarcely 
practicable for humans, but readily applied to microorganisms, with a much 
simpler genetic pattern, and which produce much larger families in a much 
shorter time. In these cases, it is found possible to construct a ‘ linkage map’ 
in which genetic traits are spaced according to the closeness of their linkage. 
A consistent linkage map can always be made in which the traits are arranged 
along a single line. 

Benzer, in studying a single genetic locus of a bacterial virus, has shown the 
extraordinary degree of refinement possible in this genetic mapping. Mutation 
at this locus prevents the virus from infecting certain types of bacteria. Benzer 
has identified 275 distinct mutations at this locus, and can show, by statistical 
analysis of their occurrence, that at least another 200 possible mutations have 
not yet been found. From the linkage map, all 275 can be arranged into a linear 
order. Assuming that the linear sequence of the linkage map corresponds to 
the linear arrangement of DNA, the total number of mutation sites is at least one 
tenth of the number of base pairs in this section of DNA. The picture of 
mutation as an alteration in a single base-pair of the DNA begins to look reason- 


able. 


6. WHAT DO THE GENES DO? 


Beadle, in 1945, suggested that the genetically transmitted information served 
to define the structure of the proteins made in the cell, and especially of the 
enzymes, which act as biochemical catalysts. He wrote: “‘ In determining the 
specific chemical, and perhaps physical configuration of protein molecules, genes 
directly determine enzyme specificities, and thereby control in a primary way 
enzymatic synthesis and other chemical reactions in the organism.” This 
theory, which became known as the ‘ one gene, one enzyme’ theory, has been 
widely accepted. ‘The gene was defined by Beadle as the “ irreducible unit of 
genetic transfer”’, but Benzer’s experiments show that, like the atom, the 
irreducible gene has turned out to have a fine structure of its own. 

To study this theory, one needs to find aberrant protein molecules which are 
produced by a single genetic defect. Strangely enough, some of the best 
substantiated examples occur in the human species, and amongst the most 
thoroughly studied is that of the sickle cell anaemia disease prevalent in many 
tropical countries. ‘The sickled shape of the red blood cell is due to reduced 
solubility of the haemoglobin it contains, which comes out of solution into a 
paracrystalline form. The disease is transmitted like a simple Mendelian trait. 
In its heterozygous (mixed gene) form, it appears to confer some immunity from 
malaria, and this evidently accounts for its prevalence. Pauling showed that 
sickle cell haemoglobin migrates differently from normal haemoglobin in an 
electric field, a difference which can be ascribed to the presence of one extra 
positive charge. An ingenious method devised by Ingram allowed a more 


The Molecular Approach to Biology 185 


detailed interpretation. Controlled enzymatic digestion reduced the protein to 
a number of small peptides (containing the order of ten amino acids), in a 
reproducible way. When these were separated by electrophoresis and chroma- 
tography, it was found that normal and sickle-cell haemoglobins gave exactly the 
same pattern, with the exception of a single peptide, which carried one positive 
charge in sickle cell haemoglobin and was uncharged in the normal protein. 
Chemical analysis showed that these peptides differed only by a single amino 
acid substitution. 

Similar studies have now been completed on several other abnormal haemo- 
globins, and in each case it seems that a simple Mendelian trait causes just one 
amino acid change. Beadle’s theory can now be stated in the form ‘ one genetic 
site, one amino acid’. The same kinds of changes are found between analogous 
proteins of related species, and when sufficient evidence accumulates a new 
natural history will have to be written, based on chemical comparison of proteins 
instead of on differences of external form. 

It would clearly be of the greatest interest to compare the amino acid sequences 
of mutant proteins made by a series of closely linked mutations of the kind studied 
by Benzer. In this way it should be possible to show whether the linear sequence 
of the genetic map corresponds to the linear sequence of amino acids in the 
protein, and perhaps also to test various hypotheses about the DNA code. While 
genetic mapping is easiest with the microorganisms, it is correspondingly 
difficult to prepare and study their proteins in quantity. So although this 
problem is being attacked by several groups, results are still awaited. 


7. 'THE MECHANISM OF PROTEIN SYNTHESIS 


The molecular mechanism by which this genetic scheme controls protein 
synthesis is still little understood. However, recent studies reveal fascinating 
clues. It is certain that the scheme depends on RNA (see p. 179) and on particles 
of RNA and protein outside the nucleus known as ribosomes. These are the 
actual site of protein synthesis, the machine tools. One complication is that 
many different types of RNA exist, with different functions, which are not 
readily separated. 

It seems that some of the RNA serves as a message, copied from the encyclo- 
paedia, giving instructions to the protein-synthesizing machinery in the ribosome. 
The evidence suggests that a given ribosome is able to synthesize any protein 
molecule. If we consider it as an automatic machine tool, the programme tape 
for its next job consists of a length of RNA, whose sequence has been copied 
from the DNA in the nucleus. The copying probably takes place by a mechan- 
ism very similar to that which reproduces DNA, but under rigid control, using 
feedback devices to relate production to current needs. 

It has been known for many years that quite simple chemical compounds 
can stimulate a cell to begin rapid synthesis of an enzyme. ‘Thus the enzyme 
B-galactosidase which breaks down f-galactoside sugars in the bacterium £. colt 
can be produced in huge quantities, but only when substances like B-galactosides 
appear in its environment. The hypothesis is that there are only certain points 
on the DNA chains where copying of an RNA message can begin. ‘The starting 
points might be likened to the beginnings of separate entries in the encyclopaedia, 
and the entry must describe all the mechanisms needed for dealing with a 
particular situation, such as the appearance of f-galactosides. Until the 


186 D. M. Blow 


particular entry is needed, the starting point is somehow masked, and message 
RNA cannot be dispatched to the ribosome until the starting point ts made active 
The working of this mechanism is unknown: what is known is that there are 
regions of the genetic map which have all the properties one would expect of 
this kind of starting point. (1) They are adjacent to, and at one end of, that part 
of the map dealing with synthesis of a particular protein; (2) mutations which 
occur in this region of the map do not affect the properties of this protein, (3) 
they do, however, cause aberration in the control of production of the protein. 

Very little is known about the mechanism by which the nucleic acid code is 
translated, although there is one significant clue. Another fraction of RNA, 
a low molecular weight form, is found to consist of a short sequence of RNA 
terminated by a single amino acid residue. It is not impossible that a matching 
with a part of the RNA sequence on the ‘ messenger’ could result in the amino 
acid being placed correctly for incorporation into a nascent protein molecule. 
It may thus act as the cypher used for decoding the message inscribed in the 
nucleic acid sequence. 

Direct evidence about the way the information provided by the RNA 
messenger is translated into protein is obtained in a remarkable experiment, 
just reported by Nirenberg and his colleagues. Ribosomes were supplied with 
amino acids, other necessary supplies, and a synthetic RNA in which all the 
bases were of one kind (polyuridylic acid). This system rapidly synthesized 
a polypeptide containing only one kind of amino acid, polyphenylalanine. The 
simplest interpretation is that the code for the amino acid phenylalanine in the 
RNA messenger language is a certain number? of uridine bases. If the scheme 
is really so simple, variations of the experiment should be able to lead to a 
complete elucidation of the coding scheme. 

An ingenious experiment by Dintzis has shown that the assembly of amino 
acids into protein is performed in a linear sequence. The red blood cells of 
leukemic rabbits were used for this experiment because during leukemia the red 
cell devotes all its capacity to the synthesis of one protein — haemoglobin. 
Ribosomes from these cells were provided with all necessary supplies to allow 
haemoglobin synthesis and were then given for a short period a supply of radio- 
active amino acids. ‘The newly synthesized protein was split up in the same 
way as in Ingram’s experiments (p. 185), and the radioactivity of the different 
peptides measured. ‘The total radioactivity increased with the time radioactive 
amino acids had been provided, but the peptides were not uniformly labelled. 
The non-uniformity was exactly the kind expected if the assembly mechanism 
had taken the peptides in a definite linear order. The order is thus shown to be 
definite: if it can be shown that the order of labelling these peptides is the same 
as their sequence in the intact haemoglobin, we shall have strong evidence for a 
completely linear mechanism. t 


8. [THE STRUCTURE OF PROTEINS 


Now we may look in more detail at the end-products of this remarkable 
sequence of events—the proteins. If DNA is the encyclopaedia of the cell, 
RNA the messenger and the cypher, and the ribosomes are the automatic 


+This number in now known to be three, from a beautiful experiment by Crick and 
others (Nature 192, 1227 (1961). ‘ 


{This has now been done. 


The Molecular Approach to Biology 187 


machine tools, the proteins provide almost everything else, from structural 
support to intricate mechanisms for carrying out the most delicate biochemical 
tasks. 

The remarkable thing about protein molecules is that although they are 
linear in their chemical structures, their shape can be anything from fully 
stretched (as in silk) to an almost spherical bundle (as in, for instance, haemo- 
globin and most enzymes). The early studies established that proteins consisted 
of amino-acids, occasionally associated with other chemical structures, that the 
polypeptide chain which linked them was never branched, and that only about 
twenty different amino acids were normally found in proteins. But they also 
made it clear that the manner by which the single chain was coiled up into a 
solid structure of definite form would play a vital part in understanding their 
behaviour. This part of protein chemistry reached maturity when Sanger was 
able to give the complete chemical structure of insulin. Insulin is a peptide 
hormone, generally thought too small a molecule to be called a protein, containing 
51 amino acid residues. Similar studies on an even grander scale have led to 
amino acid sequences being found for several proteins of more typical size, with 
the order of 150 amino acids in the polypeptide chain. In no case, however, 
can it be said that the sequence itself has revealed any correlation with the 
properties and functions of the protein that was not evident from its overall 
composition. From their appearance, the sequences might as well have been 
typed by monkeys on an amino acid typewriter, as arrived at through the millenia 
of natural selection. 

While Sanger was still publishing the details of the insulin sequence, sug- 
gestions about the secondary structure, or regular coiling, of the polypeptide 
chain were published entirely on the basis of theoretical chemistry! They are 
now thoroughly confirmed. Pauling and Corey noted that every peptide bond 
contained both a C=O group and an N—H group: raw materials for making an 
N-—H...O hydrogen bond, at a considerable saving in energy. Hypothesizing 
that such bonds were used to stabilize the polypeptide structure, they began to 
explore systematic ways of making bonds in a polypeptide chain, compatible 
with the proper lengths and angles for the other bonds, and keeping the atoms 
the proper van der Waals’ distance apart. They also noted that the resonating 


O 
Nea 
Oke - 


group must be co-planar. They found that if each C=O group were 
hydrogen bonded to the fourth NH along the chain from it, a helical structure 
fulfilling all these conditions was readily obtained (fig. 4). The helix, known as 
the a-helix, is non-integral; in other words, no small number of peptide bonds 
gives an integral number of turns to the helix. Unlike the DNA helix, the 
varying, or functional parts of the polymer stick outwards, and so have the 
maximum effect on physical properties. 

Evidence of many kinds tended to confirm that this type of coiling took place 
though it was hard to be sure that other, similar methods of coiling were ruled 
out. In any case, there was more to be found. Although some proteins, such 
as myosin (a protein concerned in the contractile mechanism of muscle) may well 
be straight a-helices, many others are not. An a-helix is about 10 A across; 
many protein molecules are 50 A or more in width. 


188 D. M. Blow 


The molecular structure of two proteins has been determined in some detail 
in the last few years by x-ray diffraction studies on their crystals. These struc- 
tures contain the order of 1000 atoms of carbon, nitrogen and oxygen in the 
repeating unit of the crystal, about ten times as many as in any structure previously 


10A 


Fig. 4. The a-helix, proposed by Pauling and Corey. ‘The hydrogen bonds which 
stablilize the helix are shown as dashed lines, and the side chains stick out in the 
directions marked R. (L. Pauling, R. B. Corey and H. R. Branson, Proc. Nat. 
Acad. Sci., 37, 207, 1951.) 


determined by x-ray analysis. (Hydrogen atoms are not counted since they 
diffract too weakly to be observed.) Before a structure can be computed from 
x-ray diffraction data, it is necessary to determine the phases of the diffracted 
beams. In this case it was done by preparing crystals in which heavy atom 
complexes were attached to different points on the molecule, and the scattering 
from the heavy atoms (Hg, Au, Pt, etc.) was sufficient to make slight changes in 
the diffraction pattern. From these, first the positions of the heavy atoms 
could be worked out, and when these were known the structure could be deter- 
mined. 

The number of orders of diffraction observable naturally increases as the 
unit cell of the crystal becomes larger. Although x-ray diffraction studies of 
proteins have so far been limited to a resolution of 2 A (that is to say diffracted 


The Molecular Approach to Biology 189 


beams were only measured if their Bragg spacing was 2 A or more), this has 
meant the measurement of 10000 different reflexions for each heavy atom 
compound of a protein of molecular weight 17000. These were measured by 
conventional techniques: an automatic diffractometer designed at the Royal 
Institution has just completed measurements of a further 13 000 reflexions out 
to 1-5 A where the protein diffraction fades into the background. The handling 
of all this data (which includes making a Fourier transformation of it) would be 
unthinkable without a fast electronic computer. 


The two proteins which have been studied are closely related. One of them 
is haemoglobin, the oxygen transport pigment of the blood. the other is myoglobin, 
which fulfils a similar function in muscle. Haemoglobin has a molecular weight 
of 68 000 and can carry four oxygen molecules, each attached to a disc-shaped 
porphyrin group, while myoglobin has a quarter the molecular weight and can 
carry but one oxygen molecule on its single porphyrin. The proteins were 
studied by two cooperating groups in Cambridge, the haemoglobin work being 
led by Perutz, and the myoglobin work by Kendrew. 


The initial difficulties held the myoglobin workers back longer, but, once they 
were solved, all the advantages have been with the smaller protein. The first 
important result was the completion of a study at 6A resolution. This limit 
was carefully chosen to show up any a-helices as clearly as possible. Apart 
from these and the oxygen-carrying porphyrin group, it could not show anything 
but the outline of the molecules. ‘The method depended on the accurate 
measurement of rather small intensity changes; moreover the method of phase 
determination had not been used before. Some crystallographers were sceptical 
about the possibility of success, but the result was entirely encouraging. It also 
showed that the structure was not one which might have been predicted by 
brilliant generalization, as with DNA and the a-helix. It was something 
completely individual and totally non-symmetric. Although the original 6 A 
map of myoglobin was not completely clear, it showed an elongated region of 
high density weaving a tortuous, convoluted course around the flattish disc of 
the porphyrin. The model illustrated shows the shape of these high density 
regions (fig. 5). 

The same stage was not reached with haemoglobin until 1959, two years 
later. The resulting map was clearer, and the model illustrated was built from 
it merely by choosing a certain level of electron density as a cut-off level. Up 
to that time there was very little evidence that haemoglobin might contain 
myoglobin-like units, and indeed the amino acid composition of the two proteins 
is quite different. It was immediately obvious that haemoglobin was just four 
myoglobin units, packed together in a way neater than one could have imagined, 
It will be noted from the figure (fig. 6) that a twofold symmetry axis passes 
through the middle of the molecule, relating the two chains in pairs. _ For 
clarity, the two pairs of chains, which are slightly different, have been painted 
different colours. Each chain follows just the same weird, convoluted course 
around its porphyrin as in myoglobin. 

Meanwhile, the myoglobin work had been advanced to 24 resolution, with 
an unexpected degree of success. 2 A is greater than the spacing between atoms 
in organic structures, and it was anticipated that only the spiralling path of the 
polypeptide chain would be discerned. However, once the enormous labour 


190 D. M. Blow 


of building three-dimensional models to study the results in detail had been done, 
it was found that many, indeed most, of the amino acid side-chains could be 
more or less certainly identified, by their shapes. The staggering of these 


Fig. 5. Model constructed from the 6 A resolution results of the x-ray analysis of myo- 
globin. ‘The white ‘sausage’ represents a continuous length of high electron 
density caused by the coiled polypeptide chain, and the dark disc shows the position 
of the isolated region of electron density due to the oxygen-carrying porphyrin group. 
The small balls show points where different heavy atoms were attached for the 
structure determination. (J. C. Kendrew.) 


around the helix axis was in perfect agreement with the o-helix structure. A 
detailed atomic model (fig. 7) could be built from the results, which although 
tentative in parts, is well confirmed by chemical studies of the protein, which 
are being intensively pursued. 

This model gives us our first insight into the way the linear or one-dimensional 
form of encyclopaedic DNA and the messenger ‘ tapes’ of RNA is converted 
into three-dimensional structure. The corners in the polypeptide chain are 
frequently stabilized by hydrogen bonds across from one part of the chain to 


The Molecular Approach to Biology 191 


Fig. 6. 


Results of the x-ray analysis of haemoglobin at 6A resolution. Areas above a 
certain level in each sheet of the three-dimensional electron density map have been 
built up into a solid model. This model was found to consist of four parts, and 
in the upper figure the two pairs have been separated. Each pair is related by a 
symmetry axis, and they are painted different colours for clarity. The model 
below shows how the four units fit together to make one molecule. (M. F. Perutz.) 


192 D. M. Blow 


another, and often the one amino-acid, proline, which cannot be built into the 
a-helix, is implicated. There is not yet, however, enough information to formu- 
late generalizations about these corners which would allow the three-dimensional 
structure of other proteins to be inferred from their amino-acid sequence. There 
is always the possibility that the protein is fashioned into its three-dimensional 
form on some kind of special cellular jig designed for the purpose. But the 
undoubted fact that the information is stored in one-dimensional form makes 
this merely an added complication. The simplest hypothesis is that the protein 
coils spontaneously into its most stable form, which is completely determined 
by its own chemical structure. 


Fig. 7, Interpretation of the 2 A resolution results for myoglobin. The bonds connecting 
atoms along the polypeptide chains have been emphasized, and a string has been 
threaded along the chain. Many sections are straight a-helices. Some of the 
side-chains are speculative. ‘The flat porphyrin group is seen edge on, near 
the top. (H. C. Watson and J. C. Kendrew.) 


The similarity of coiling shown by myoglobin and haemoglobin is the more 
surprising when it is realized that the myoglobin studied by Kendrew is from 
sperm whale, while the haemoglobin studied by Perutz is from the horse. The 
data available about amino acid sequences in myoglobin and haemoglobin 
indicate a comparatively slight correlation between them. ‘Thus the shape of 
the folded molecule appears to be of more fundamental significance than its 
amino acid sequence. ‘This vindication of an experimental approach which is 
not only laborious, but at one time seemed to many quite hopeless because of the 
complication of the problem, should stimulate studies on many other proteins. 

In this very rapid survey of a field of study which has grown from almost 
nothing in the past decade, the relevance of physical thinking and physical 
methods to the study of biological systems should have become obvious. There 


The Molecular Approach to Biology 193 


is, however, a more fundamental aspect to it. From the physical scientist’s 
point of view, and with the greatest respect to the biologist’s, a science reaches 
maturity when it passes from the stage of observation and classification to the 
stage of inductive logic about general hypotheses. Nobody would claim that 
biology has just entered this stage; the great forefathers of the era are Darwin 
and Mendel. But it seems that for the first time a large body of evidence is 
being assembled from which general hypotheses about biological systems, 
firmly based on physical models, can be made. 

The underlying assumption is, of course, that living organisms can be treated 
as physical systems, whose behaviour is capable of analysis in terms of the usual 
laws governing atoms and molecules. The experiments which have been 
described support this assumption, and we can feel that the analysis has, at 
least, begun. Although this perhaps does not answer any philosophical questions 
about the nature of life, it should at least cause some of them to be phrased 
more carefully. 


The Author: 


Dr. D. M. Blow was educated at Kingswood School, Bath, and Corpus Christi College, 
Cambridge, where he read physics. Most of his scientific career has been at the Medical 
Research Council’s Unit in the Cavendish Laboratory, where he works as a crystallographer 
with a broadening interest in biology. 

CoP. N 


Time and Relativity: Part I 


by O. R. FRISCH 


Jacksonian Professor of Natural Philosophy, Cambridge University 


In a previous article we have seen that clocks behave in a somewhat surprising 
manner, according to the special theory of relativity: two events, simultaneous 
by the clocks in one inertial frame, will in general not be simultaneous by the 
clocks in another inertial frame, and of two clocks in relative motion, each will 
appear slow to an observer moving with the other. But there was no reason 
to think that two identical clocks at relative rest would not go at the same rate. 
We shall now show that this is no longer so for clocks in an accelerated system, 
or in the presence of gravity. 

Let us consider a spaceship of length 6 with its rockets going full blast, 
giving it an acceleration a; Francis (in front) and Robert (at the rear) have 
identical clocks. Light takes the finite time b/¢ to go from Francis to Robert, 
and during that time the ship gains the speed ab/c; we shall assume that speed 
to be small compared with c. By the time Robert receives a signal—say a train 
of monochromatic light or radio waves—he has the speed ab/c relative to the 
inertial system in which Francis was at rest when that signal was sent out. 
Hence he will observe that signal with a Doppler shift, that is with a frequency 
increased by the factor +/ {(¢+v)/(e—v)}~1+v/e=1+4ab/c?. This factor is 
independent of the frequency of the light and applies equally well to the ticks 
of Francis’ clock; hence that clock will appear fast to Robert, by the factor 
1+ ab/c?. In the same way we can see that Robert’s clock will appear slow to 
Francis, by the factor 1—ab/c?. All this is a straightforward consequence of 
the Doppler effect and does not even depend on the principle of relativity. 

Inside the ship any object released will no longer be accelerated and will 
consequently drop behind; it will be seen to move with the apparent acceleration a 
toward the tail of the ship. Einstein pointed out that this behaviour of objects 
in the ship might equally well be due to the gravitational attraction of a planet 
on which the ship was standing on its tail, provided the empirically found 
proportionality between weight and mass is strictly correct. He took an im- 
portant step beyond this when he proposed (see Einstein 1911) his Equivalence 
Principle, by which a uniform gravitational field is equivalent to an accelerated 
frame of reference in every aspect, including the behaviour of electromagnetic 
field such as light signals. From this he developed later his ‘General Theory of 
Relativity ’; but here we shall be concerned only with the direct consequences 
of the Equivalence Principle. In particular we can see that two clocks, placed 
at the top and the foot of a tower, will behave just like our two clocks in the nose 
and the tail of an accelerating spaceship; we have only to replace 6, the length of 
the ship, by h, the height of the tower, and a, the acceleration of the ship, by g, 
the acceleration due to terrestrial gravity. The Equivalence Principle then 
predicts that the observer on the top of the tower would see the other clock go 
slow compared to his own identical clock by the factor 1—A¢/c? where A¢ =hg 
is the difference in gravitational potential; for instance, excited atoms near the 
foot of the tower would emit spectral lines which have lower frequency (are 


Time and Relativity—II 195 


shifted toward the red) compared to the lines emitted by the same kind of atoms 
at the top. 

It may be well to make this result plausible in a different way. Let us 
consider two equal atoms at a different height, connected over a pulley by that 
useful piece of hardware, a weightless string (fig. 1). We assume that to begin 
with the lower atom is in its first excited state, possessing the extra energy E and 
hence—according to the special relativity theory—the extra mass E/c2. We 
then allow it to emit that extra energy as a photon which travels upwards until 
it is absorbed by the upper atom. This will now be the heavier and will sink 
down, turning the pulley and doing work amounting to hg. E/2=E. Ad /c?. 
Now we are apparently again where we started and can repeat the process 
indefinitely, obtaining work all the time. Have we indeed invented perpetual 
motion? 


Fig. 1. Two identical atoms, hung over a pulley by a weightless thread, apparently make 
perpetual motion possible; the puzzle is resolved by the gravitational redshift of 


photons (see text). 


Surely not; the work done by the pulley must come from somewhere. 
Everything fits if we assume that the photon on arriving at the upper atom has 
not quite enough energy to excite the upper atom; indeed its energy must fall 
short just by £ . Ad/c?, the work we expect from the pulley. So the quantum 
must arrive with the energy (1 —A¢/c?), and if we remember the proportionality 
between frequency and energy of a light quantum we see that this tallies with 
the conclusion we got from the equivalence principle. 

Actually it is not necessary to bring in the quantum theory; merely from the 
Lorentz transformation of energy and momentum it can be shown that the energy 
contained in a light signal changes by the same factor as its frequency if it is 
observed in a different frame of reference. So the proportionality on which our 
agreement rests need not be taken from the quantum theory; it is implicit in the 
special theory of relativity. The fact that the atoms suspended from our weight- 
less string have quantized energy states is not really important; we could have 
used searchlights instead and would still have to come to the conclusion that a 


light signal on climbing against gravity loses energy and hence frequency. 


196 O. R. Frisch 


But, you will ask, how can the signal lose frequency? Since the distance 
between source and receiver is constant, surely the number of wave crests that 
arrive at the receiver must equal the number that left the source during the same 
time; so the frequency must be the same! That is indeed so provided we use 
clocks that have been synchronized by signals, say from a common master clock, 
rather than clocks of identical construction, as we have assumed so far. In 
the presence of gravity we must distinguish between two ways of establishing 
time units in places of different gravitational potential. We can (1) set up a 
system of clocks that are synchronized by wires or radio signals. In that case 
the frequency of a signal remains constant (by definition!) on travelling from one 
clock to the other, and any number of clocks at relative rest can be synchronized 
unambiguously in that way, whatever gravity fields may be present. ‘The only 
disadvantage of that arrangement is that the frequency of a primary standard 
such as a caesium clock will no longer be ‘ standard ’: it will depend on the gravita- 
tional potential at the point where the clock is placed. Alternatively we can (2) 
rely on independent clocks of identical construction, for instance caesium clocks; 
then if we define frequency at each place with reference to the local clock we 
must accept that a signal decreases or increases in frequency if it travels in a 
direction of increasing or decreasing gravitational potential. 

In practice the differences are so small they don’t matter; even on Mount 
Everest the increase in clock rate against sea level, hg/c?, would be only one part 
in 10!2. However, caesium clocks and other atomic clocks are now approaching 
that accuracy, and an experimental test may soon be possible. A much greater 
difference in potential, Ad/c?=2-12 x 10-®, exists between the Earth and the 
surface of the Sun, and even higher potentials must exist on the surface of the 
very dense white dwarf stars. Einstein predicted as early as 1909 that spectral 
lines from those sources should be shifted to the red, compared with the lines of 
the same atoms on Earth. ‘This gravitational red-shift has been looked for most 
carefully but is obscured by a variety of other line shifts, and although there is 
some evidence for it the results don’t seem to be conclusive. 

However, a recent discovery by Méssbauer (1959) has made it possible to 
use certain radioactive nuclei as extremely accurate ‘ clocks’. It had long been 
realized that the frequency of the gamma rays emitted from nuclei is extremely 
well defined; for instance with a frequency v around 102° c/sec and a mean life 
time 7 of 10~7 sec the natural line width Av/y=1/2av7 would be less than 10-18. 
But the gamma rays actually observed have their frequency reduced by the recoil 
of the nucleus and the line broadened by its thermal motion (Fig. 2). Méssbauer 
discovered that when the gamma ray energy is not too high, say below 100 kev, 
some of the quanta will be emitted with the full excitation energy, the recoil 
having been taken up by the whole crystal lattice rather than by the emitting 
nucleus. ‘The lattice is so heavy that its recoil energy is quite negligible, and 
it can be shown that the thermal broadening also disappears in that case. In 
absorption, too, the momentum of the quantum can be taken up by the lattice. 
Both the emission and the absorption spectrum then show a very sharp line 
superimposed on their normal spectrum (fig. 2), and this gives rise to a strong 
resonant absorption. Mdéssbauer showed that this resonance can be destroyed— 
i.e. the absorber becomes more transparent—if source and absorber are made to 
move one relative to the other so that the emission line is Doppler-shifted as 
seen by the absorber; the extreme sharpness of the resonance was strikingly 


Time and Relativity—I] 197 


demonstrated by the fact that a relative speed of only 1 cm/sec~! was ample to 
destroy the absorption! 

A search through the known nuclides quickly showed that a particularly 
favourable nucleus should be *’Fe, whose first excited state—conveniently 
obtained by the beta decay of 5’Co, with a half-life of half a year—lies only 
14-4 kev above the ground state and has a long mean life 7= 10-7 sec. The corre- 
sponding gamma ray has a frequency v=3-5x10!8 and a relative line width 


Emission 


\ 


Fig. 2. Both in emitting and in absorbing a gamma quantum of energy E, a free nucleus 
(mass m) takes up a momentum £/€ and hence an energy E?/2me?; furthermore the 
thermal motion causes Doppler broadening of the line. Hence the emission and 
absorption lines are separated by E?/mc”, and both are broadened. But with bound 
atoms the lattice can occasionally take up the momentum, and the energy lost is then 
negligible; there appears an unshifted line, both in emission and absorption, causing 
strong resonance over a width determined chiefly by the mean life of the excited 
state. 


1/27v7=5 x 10-18, and from this it follows that the resonant absorption should 
drop to one half if the source is moved relative to the absorber at a speed of 
about 0-01 cm/sec~!. Experiments (fig. 3) showed the line to be slightly wider 
depending on the material used and its heat treatment. ‘The reason is probably, 
that the line is actually a Zeeman multiplet formed by the effect of the atomic 
magnetic field on the nucleus, and it is difficult to make the fields exactly alike 
in source and absorber. 


198 Oy R. Frisch 


Even so, here was clearly a possibility to detect extremely small frequency 
changes, and the idea of looking for Einstein’s gravitational shift occurred to a 
number of people. All that seemed necessary was to mount the source and the 
absorber, one above the other, as far apart as the gamma ray intensity would 
allow, and then look for a small asymmetry in the Doppler shift curve (fig. 3). 
A race developed between Harvard and Harwell, in which Harwell obtained 
the first indication of a shift (Cranshaw et al. 1960); but doubts were cast on that 
result when it was found by Pound and Rebka (1960) that a shift of the same 
magnitude might have been produced by a temperature difference of less than 
1°c between source and absorber. 


eS cm/sec 
= 0:06) -0.04; -002 e) 0.02 0.04 0.06 


80% 


Transmission 


Fig. 3. Typical transmission, through a thin iron foil, of the 14-4 kev gamma rays of 
excited ®’Fe, as function of the relative velocity of source and foil. The absorption 
is almost entirely due to the ‘ recoil-less’’ resonance, which is so narrow that the 
Doppler effect from quite slow relative motion is enough to spoil it. 


Such a temperature effect had independently been forseen by Josephson 
(1960) and can be described simply as the transverse (quadratic) Doppler effect 
due to the thermal motion of the nuclei. The ordinary, linear Doppler effect 
is of course many thousand times larger, but is absent in the Méssbauer line; 
one may say that it is cancelled out because the emitting nucleus, carrying out 
its thermal oscillations, changes its direction many million times during the mean 
life of the excited state. But there remains the second-order Doppler effect, 
or in other words the relativistic dilatation, discussed in the previous article. 
This amounts to Av/y = 4v?/e? = U/2¢e? if we assume that 4v?, the mean kinetic 
energy per unit mass of the lattice, is half its total energy U per unit mass, as it 
will be if the nucleus is bound to the lattice by elastic forces. 

Of course if the absorber is at the same temperature as the source one expects 
no observable shift, but if they differ by AT the shift will be AT . Cp/2c? where 
Cy is the specific heat. For iron at room temperature this comes to 2-2 x 10-15 
for 1°c temperature difference, equivalent to the gravitational shift over a vertical 
distance of about 70 ft, almost twice the distance used by Cranshaw et al. (1960). 


Time and Relativity—II 199 


So it is necessary to keep source and absorber at accurately the same 
temperature. Even then there may be a small shift if source and absorber 
are not exactly the same material: the Debye temperature and hence the zero 
point energy may be different, and that may cause a difference in U even if the 
temperature is the same. By checking all those effects, Pound and Rebka (1960) 
verified the existence of the gravitational shift beyond doubt and later found 
it to be 0-97 + 0-04 times the predicted value. 

If the thermal motion caused the line to be red-shifted that means that not all 
the excitation energy E of the nucleus appears in the radiation emitted; what 
happens to the missing energy? The answer, surprisingly, lies in the minute 
decrease in mass, E/c?, that the nucleus suffers on emitting the quantum. We 
may consider the nucleus as bound by harmonic forces to its place in the lattice; 
those forces remain unchanged, and hence its oscillation frequency f—which is 
proportional to m—1/2_will increase by Af/f=4Am/m=4E|/mce?. The fact that 
the Méssbauer effect is recoilless means that the oscillation quantum number 
remains unchanged, so the oscillation energy O = Um of the atom will increase 
in proportion with the frequency and thus by the amount AO=Q. 4E/mc? 
=E . U/2c?, just the amount that we saw was missing. It may be objected that 
individual atoms do not behave like separate oscillators, but form part of a lattice 
of many atoms, and that the effect of the change in mass is therefore many times 
smaller; but then we must also remember that many different oscillations modes 
of the lattice are affected, and if this is properly allowed for one gets the same 
answer as before. 

It is perhaps instructive to go back to the transverse Doppler effect, discussed 
in the previous article, and ask what happens to the energy there. Consider an 
excited atom, with the excitation energy E, travelling at the speed v relative to 
our ‘rest system’. If it emits its energy as a quantum that, in the rest system, 
travels at right angles to the motion of the atom, that quantum will be red-shifted 
and will deliver only the energy Ey/ {1 —(v?/c?)} to an absorber at rest. If we 
ignore powers higher than v?/c? an energy amount EF. $v?/c? is missing. What 
has happened to the missing energy? Here again the clue is that the atom has 
become lighter by Am=E/c?. In the rest system its momentum will be un- 
changed since the photon went off at right angles (ignoring, for the moment, the 
small recoil effect); so its kinetic energy E,=p?/2m has increased by 
Am . d(p?/2m)/dm=E . 3v?/¢? which is just the missing energy. 

Actually with a little algebra we can solve the problem exactly for any speed 
of the atom, and allowing for the recoil which causes it to change direction. We 
find that the energy of the emitted photon is changed by the factor (1 — E/2mc?) 
»/ {1—(v?/e?)}. The second term in the first bracket disappears when we 
neglect the mass £/c? of the photon compared to that of the ‘ clock ’, as we usually 
do in relativity theory. In the same way we can verify that the law of energy 
conservation holds for the ordinary, linear Doppler effect; but in that case the 
recoil of the clock, even if it is treated as very heavy, remains significant. 

Let us come back to the temperature effect of the Méssbauer line. The 
measurements (Pound and Rebka 1960) have shown it to be in good agreement 
with special relativity, and this also shows (Sherwin 1960) that those nuclear 
clocks are wonderfully insensitive to shaking; their thermal motion subjects them 
to incessant accelerations of about 10!% (~10!® cm/s~*) without affecting their 
rate by more than one part in 101°. 


200 O. R. Frisch 


A more direct test of the effect of speed on clock rate was performed (Hay 
et al. 1960) by having the source near the centre and the absorber at the periphery 
of a fast spinning wheel. Speeds up to about 200 m/s were used, giving an 
expected time dilatation by about 2 parts in 101%, nearly one half of the line width 
for 7Fe. The difficulty here was to avoid the linear Doppler effect, which in the 
line of motion would have been about a million times larger. It would be 
impossible to mount the source so close to the centre of the wheel that the absorber 
would have no significant radial velocity component relative to the source. ‘The 
problem was solved by attaching the source to the wheel, near its centre; then 
as long as the wheel is rigid enough (and this was checked) the distance between 
any parts of the absorber and of the source remains constant despite the rotation, 
and thus there is no first-order Doppler effect. Figure 3 shows that for small 
speeds the increase in transmission goes with the square of the line shift, which 
in turn goes with the square of the speed. The resulting increase of transmission 
with the fourth power of the speed was indeed observed, up to the calculated 
increase by about 4 per cent at 200 m/s. Of course the direction of the line 
shift could not be verified in this experiment, only the amount. 

It is quite legitimate to use special relativity as long as we refer all motions 
to an inertial system, most conveniently to that in which the centre of the wheel 
is at rest. But what if we use a frame of reference that rotates with the wheel? 
In that frame both source and absorber are at rest; but an observer experiences 
a centrifugal acceleration rw? which he attributes to a gravitational force with the 
potential 47r2w?=4v". Consequently if the observer is at the edge of the wheel 
he expects photons, emitted near the centre, to be blue-shifted by Av/v=¢/c? 
= }v*/c? on arriving at the periphery. But this is just what we deduced from 
special relativity, using an inertial frame of reference. ‘The experiment of Hay 
et al. has sometimes been said to confirm a conclusion from general relativity; 
but actually special relativity is quite sufficient to predict its outcome, and the 
equivalence principle need not be invoked. 

This is perhaps a good place to go back to the twin paradox, as discussed in 
the previous article. There we concluded that Jack, who has been at rest in 
the same inertial frame all the time, experienced the passage of a longer time than 
his twin brother Albert who travelled out into space and back again. But are 
we not entitled to use Albert’s ship as our frame of reference? In that frame, 
Albert is stationary throughout while it is Jack who moves out into space and 
back again. So now it looks as if it should be Albert, not Jack, whose clock 
shows the passage of a longer time. Since those two answers are in conflict it 
has sometimes been argued that neither can be true, and that both brothers 
must age the same, whichever way they move. 

But if we choose a frame in which Albert is at rest and Jack goes out and back 
again, then in that frame there must be a gravitational field to account for the 
accelerations experienced by Albert, and for the fact that Jack does not feel any 
acceleration although he goes out and back. We shall show that if we include 
the frequency shifts produced by those gravitational fields we arrive at the same 
conclusion as before, and that there is no conflict between the two treatments. 

To simplify the argument we shall assume again that the speeds are so small 
that powers higher than (v/c)* can be neglected. The initial speed-up and the 
final slow-down are unimportant because Jack and Albert are so close together 
that they are both at almost the same gravitational potential (and might indeed 


Time and Relativity—II 201 


avoid those accelerations by comparing their clocks in flight as they pass each 
other without matching speed, neither initially nor at the end). The important 
period is that during which Jack—in Albert’s system !—reverses speed. Let 
us say that Albert puts on his rockets for the time ¢ so as to produce an acceleration 
a; then in his frame of reference there is a corresponding gravitational field that 
counteracts the rockets and, during the time ¢, gives the acceleration a to Jack. 
The distance between the brothers at that period is v7 where T is the time 
they have been flying apart at the speed v. (In our approximation it does not 
matter whether 7 is measured by Jack or Albert.) Therefore the gravita- 
tional potential difference between them is avT, and the gravitational shift causes 
Jack’s clock to gain the amount tav T/c? during the time ¢ the acceleration 
ison. Nowt has to be long enough to cause reversal, that is at = 2v. Hence 
the time gained by Jack’s clock is 2T v?/e?. But this is just twice the amount 
which it should have lost according to the argument from the previous article; 
remember that in Albert’s frame, which we have now been using, it is Jack who 
has been the traveller. Altogether Jack’s clock has gained Tv2/e? and this is 
just the amount we found previously by referring everything to Jack’s inertial 
system. So the answer is the same whichever of the two frames of reference 
we use. 

Of course we have demonstrated this only for small values of v/¢ where 
powers higher than (u/c)? can be neglected. Otherwise we would have needed 
the mathematical apparatus of general relativity; that demonstration has been 
published by Born and Biem (1958). 

It remains to say a little about satellites. Circling the Earth once in 90 min, 
a low-orbit satellite has a speed of 7-7 x 10° cm/s and hence the time dilatation 
amounts to 3-3 parts in 10, an amount easily measurable with present-day 
atomic clocks (Singer 1956). On the other hand, the gravitational field between 
us and the altitude at which the satellite travels speeds up its clock rate. If we 
confine ourselves to circular orbits of radius r while ry is the radius of the Earth, 
both the velocity v and the gravitational potential ¢ are constant over the orbit; 
the relative speeding-up of the satellite clock compared to an identical clock 
stationary at sea level is then Av/y=(Ad—v?/2)/e?. Now Ad¢= Jf (gr?q/r?)dr 
=gr2,(1/r,—1/r). To get v we write the centripetal acceleration v?/1 =gr)/r?, 
hence v?=gr?,/r. So we get Av/v=(gr,/¢?) (1 — 3r9/2r) =7 x 10- x (1 — 379/2r). 
For low altitudes the time dilatation (the second term) prevails; above an altitude 
of r,/2 (just about 2000 miles) Av/v becomes positive because the first term, 
the relativistic blue-shift, becomes dominant. ‘These predictions will presum- 
ably be tested within the next few years though there would seem to be little 
doubt about the outcome. 


REFERENCES 


Born, M., and Biem, W., 1958, Kon. Ned. Akad. Wet. B61, 110. 

CransHaw, T. E., Scuirrer, J. P. and Wuiteneap, A. B., 1960, Phys. Rev. Letters, 4, 163. 

ErnstEIn, A., 1911, Ann. d. Physik, 35, 898. 

Hay, H. J., Scuirrer, J. P., CransHaw, T. E., and EcELsrarr, P. A., 1960, Phys. Rev, 
Letters, 4, 165. 

JosePHsoN, B. D., 1960, Phys. Rev. Letters, 4, 341. 

Mosspaurer, R., 1959, Z. Naturforsch., 14a, 211. 

Pounp, R. V. and Resa, G. A. Jr., 1960, Phys. Rev. Letters, 4, 274, 337; 

SHERWIN, C. W., 1960, Phys. Rev., 120, 12. 

Sincrr, S. F., 1956, Phys. Rev., 104, 11. 


The Diploma in Technology : Courses in Applied Physics 


by N. PENTLAND 
Brighton College of ‘Technology 


During the last decade the number of students attending full-time courses 
in Technical Colleges for the external honours degree of London University 
has fallen quite sharply and as a consequence many Colleges have been unable 
to continue to offer these courses even if they so wished. During this period, 
however, technical education has undergone a remarkable change which has 
enabled Colleges of Technology to devise new courses and so make their own 
particular contribution to the education and training of technologists as distinct 
from that of the Universities. 

In the years following the war there was much discussion about the future 
role of Technical Colleges but, regrettably, little progress was made until the 
publication of the White Paper on Technical Education in February 1956.[4 
In this the Government announced its decision to allocate, over the succeeding 
five years, 70 million pounds for the provision of new buildings for Technical 
Colleges and at the same time gave considerable encouragement to the larger 
Colleges to redouble their efforts and turn their attention to the education of 
technologists in a way which was to be different from the traditional University 
degree course. In fact, the Universities and Colleges of Technology were to make 
a joint contribution to the total output of technologists. This was to be achieved 
by the establishment of the National Council for Technological Awards—an in- 
dependent self-governing body responsible for the administration of awards of 
high standing which would be available to students attending courses approved 
by the Council and operated in Colleges of Technology. The Council published 
its first memorandum in May 1956l*] and in this indicated its intention to award 
a Diploma in Technology (Dip.Tech. or Dip.Tech.(Eng.) ) to a student who 
successfully completed an approved course which was to be “‘ equivalent in 
standard to an honours degree course of a British University”. This equivalence 
was merely one of standard and certainly not of content; Colleges were expected 
to introduce courses which would have a definite technological bias and which 
would be closely linked with industry and reflect the modern developments 
taking place in the particular technology of the subject concerned. 

Considerable progress has been made since the publication of this first 
memorandum and in the latest reportl*! of the N.C.T.A. covering the period 
April 1960 to March 1961 there are listed eleven courses in Applied Physics 
now in progress and being attended by a total of 464 students. Of these eleven 
courses ten are 4-year sandwich courses whilst one is a 4-year full-time course 
in which three years are spent in College and one complete year in industry. 
The sandwich courses are generally operated so that the student spends six 
months (26 weeks) in College followed by six months in industry in each year of 
his 4-year course. It is expected that the periods of industrial training will be 
as Closely integrated with the academic periods as circumstances allow. 


The Diploma in Technology: Courses in Applied Physics 203 


Students who attend courses leading to the Diploma in Technology fall into 
two main categories as regards financial support. There are those who are 
recruited by industry and are sent to a particular College for their periods of 
academic training and who return to their sponsoring firms for all their industrial 
training. ‘These students are financed throughout by their firms and are referred 
to as industry-based students. On the other hand some students apply direct 
to a College and are enrolled for a course as College-based students. Such 
students are in receipt of an award from a Local Education Authority or hold a 
State Scholarship or Technical State Scholarship for their periods of academic 
training. ‘These students are placed by the College with firms for their industrial 
training and are paid by the firms during these periods exactly as industry-based 
students. Students in this latter category may go to the same firm for each 
period of training or they may be placed with as many as four different firms 
during the extent of their course. There are, in addition, a small number of 
students who are recruited by industry and who, for one reason or another 
(often financial), are given leave of absence to attend a College whilst being sup- 
ported by a L.E.A. or other State award but who return to, and are paid by, their 
parent firm for all their industrial training. Of the 83 students attending the 
Diploma in Technology course in Applied Physics in my own College last 
session, 74 were industry-based, 4 were College based and 5 fell into the hybrid 
category described above. 

There are two ways in which a student may qualify for entry to a Dip. Tech. 
course. One is for the boy or girl to possess a minimum of five passes in ap- 
propriate subjects in the General Certificate of Education Examination, of which 
at least two are at Advanced Level. ‘The other is for the applicant to have gained 
an Ordinary National Certificate in the appropriate subject at an approved high 
standard. Of the 83 students referred to above only 5 (or 6 per cent) were 
admitted through the O.N.C. route but of the remainder 7 (or 8 per cent) obtained 
4 passes at ‘A’ or ‘S’ level, 41 (or 50 per cent) obtained 3 passes at ‘A’ or ‘5S’ 
level, and 30 (or 36 per cent) obtained 2 good passes and several in this latter 
group had also obtained good passes in additional O.N.C. exemption subjects. 
It may also be of interest to note that 11 girls were included in the total. 

As already stated Colleges are expected to develop these new courses in such 
a way that they will have a technological bias, that they will be closely linked 
with industry and modern developments therein, and yet that they will be deep 
enough to provide a discipline of the kind usually associated with an honours 
degree course. This is no easy task and in any one subject there is obviously 
no unique scheme. It is necessary to experiment and it is inevitable that even 
in the eleven physics courses now in progress there will be differences in structure 
and in content. By way of illustration, however, let us consider in broad outline 
the main features of the course at Brighton, a course which is now in its fifth 
year of operation and with which the author has been associated since its in- 
ception. 

So far as it is possible to differentiate between pure physics and applied 
physics the course aims to provide, in the early years, a firm foundation of pure 
physics and the student is given a thorough understanding of the fundamentals 
of his subject. The amount of time devoted to pure physics decreases in suc- 
cessive years of the course and in its place there is a corresponding increase in the 
amount of applied physics with its emphasis on the industrial applications. In 


204 N. Pentland 


CURRICULUM 


First COLLEGE PERIOD 

Physics (Magnetism, Electricity, Atomic 
Physics, Heat, Light, Properties of 
Matter) 

Applied Physics (Electronics, Photo- 
graphy) 

Mathematics 

Workshop Practice and Drawing 

Liberal Studies 


SECOND COLLEGE PERIOD 

Physics (Electricity, Atomic Physics, 
Heat, Light) 

Applied Physics: Electron Physics, 
Electronics, High Vacuum ‘Tech- 
nology: 

Applied Spectroscopy: 

Mathematics 

Electrical Technology 

History of Science 

Liberal Studies 


THIRD COLLEGE PERIOD 


Physics (Electricity Atomic and Nuclear 
Physics, Heat, Light, Mechanical 
Properties of Materials ) 

Applied Physics: Electronics: 

Applied Acoustics: 

Mathematics 

History and Philosophy of Science 

Liberal Studies 


FourTH COLLEGE PERIOD 
Physics (Light, Theoretical Physics) 
Applied Physics: Electronics: 
Solid State Physics: 
X-ray and Electron Diffraction: 
Nuclear Techniques and applica- 
tions: 
Project 
Mathematics 
Industrial Economics 
Liberal Studies 


Number of hours per week 
Lectures or 


Tutorials Laboratory 
6 6 
1 2 
6 

3 
” 
Total 26 
4 3 
3 3 
1 3 
6 
1 
1 
1 
Total 26 
5 3 
1 3 
2, 3 
4 
1 
2 
Total 24 
2 
1 z 
2 
2 
3 
1 
5 
2 
1 
2 


Total 23 


The Diploma in Technology: Courses in Applied Physics 205 


some courses it is usual to offer special options in the last year of the course 
and thereby deal with a smaller number of branches in greater depth. In the 
course described there are no options and all students follow the same course 
and attend the same lectures. Our view at the moment is that it is preferable 
to adopt this procedure and then allow the student to specialize once he has 
obtained his qualification and begun full-time employment in his chosen branch 
of industry. 

The mathematics syllabus has been carefully selected to include those topics 
which are of particular use to the physics course. The teaching of mathematics 
is more concentrated in the early years of the course so that the student is familiar 
with the various techniques before he needs to use them in solving the problems 
he meets in physics. In fact, the final examination in mathematics is held at 
the end of the third year. In the fourth year the time given to mathematics is 
used to introduce the student to some of the topics he is likely to require in the 
future but which are considered beyond the standard of the Dip.Tech. 

In the first year students follow a course of workshop practice where the main 
idea is not so much that they should acquire proficiency as that they should learn 
the capabilities of the machine tools of a physics workshop. In the second year 
there is a short course in electrical technology in which the aim is to give a general 
account from the practical point of view of transformers, motors, electrical 
supply systems, etc.; the physicist being primarily interested in the use of such 
equipment rather than its design. 

An important feature of the course is the inclusion of ‘ liberal studies’ in 
each year. It is somewhat unfortunate that such a title should have been chosen 
and perhaps ‘ general studies ’ would have been a more apt choice. As shown in 
the scheme outlined, time is given to discuss the history and philosophy of science 
and to learn the elements of industrial economics. In the first year every attempt 
is made to develop practical skills in writing and speaking, and to develop a 
critical judgement in reading, listening and viewing. ‘The remainder of the time 
is devoted to the discussion and examination of many and varied topics which 
are linked together under the general title of “ the emergence of the contemporary 
scene”’. ‘The aim of this course is to stimulate student interest in many aspects 
of the present day world and ample opportunity is given for discussion and student 
contribution. The subject runs throughout the course and deals with the ten- 
sions, the hopes and the fears of the present age and the stages by which it has 
come into being. Particular histories are dealt with including art, architecture, 
literature, music, religion and philosophy. In the final year further practical 
instruction is given in speaking and writing, the conduct of meetings, dictation, 
etc., and the presentation of technical information. 

Every student undertakes a project in his final year and this involves him in 
what is to him, at any rate, an original investigation. We regard this project as 
an invaluable part of the course and an activity which is most helpful in pro- 
viding information of use in deciding the final classification. It provides a 
training in research methods, in the development of experimental techniques, in 
the ability to plan experiments. It provides a training in the finding of informa- 
tion and helps the development of initiative and the ability to work alone. It 
teaches the student to learn the art of perseverance, to become used to frustration 
and disappointment, to become self-critical and honest in assessing his own work. 
To provide this training the actual topic is, to some extent, immaterial but, in 


206 N. Pentland 


general this will be associated with his industrial interest and his firm will be 
consulted when his project is being formulated. In some cases the project 
originates from the firm but all the associated practical work is undertaken at 
the College. The problem is decided upon at the end of the third College period 
and the student and supervisor confer before the next industrial period begins. 


Fig. 1. Neon flash tube counter. 


No preparation is undertaken however until some 5 or 6 weeks prior to the student’s 
return for his final College period. He is then given references to original 
papers and to books for general reading and he begins his experimental work 
immediately he returns to College. He spends the full six months on this, one 


The Diploma in Technology: Courses in Applied Physics 207 


day per week plus whatever extra time he feels able to devote to this part of his 
course, and presents his report within three months of starting his final industrial 
training period. ‘The external examiner visits the College to talk to all final 
year students while they are at work on their projects some five or six weeks 
before these are completed and then later, following his examination of the 
project report, he subjects each student to a viva voce examination in the presence 
of his project supervisor. In this way, and in conjunction with the assessment 
of routine laboratory work during the third and fourth years, an estimate is made 
of the student’s practical ability and this constitutes a substantial fraction of the 
final examination mark. 


HIGH VOLTAGE 
PULSE GENERATOR 


COINCIDENCE 
CIRCUIT AND 
AMPLIFIERS 


1000 V_ DISTRIBUTION 


PAIRS OF GEIGER 
COUNTERS ATA. B&C, 


OSCILLOSCOPE NEON FILLED TUBES 


BETWEEN HIGH VOLTAGE 
ELECTRODES 


QUENCH 
UNIT 


Fig. 2. 


The following brief outline of some of the projects undertaken during the 
last session will give some idea of the type of work included. 

One such project was concerned with the development of neon flash tubes for 
the detection of ionizing radiations. Neon flash tubes are glass tubes filled with 
the gas and placed between electrodes. Traversal of a tube by an ionizing 
particle, followed by rapid application of a strong electric field, causes a visible 
discharge. ‘The apparatus (fig. 1), shown schematically in fig. D. consisted of a 
stack of flash tubes within a geiger-counter telescope together with a high voltage 
pulsing unit. ‘The investigation was concerned with the performance of the 
tubes under various conditions. However, the electronic circuitry, practically 
all constructed from individual components by the student, occupied more time 
than anticipated and left less time than originally intended for investigating 


208 N. Pentland 


performance. ‘This project will be continued by a final year student during the 
next College session. 

Another project involved an investigation of cathodic vacuum etching 
parameters. Again the apparatus was built entirely by the student (fig. 3). 
Cathodic vacuum etching is the production of etch patterns on solid surfaces 


Fig. 3. Cathodic vacuum etching apparatus, 


by bombardment with positive ions in a glow discharge. Etch patterns were 
produced and examined on brass, steel and silicon and the effect of varying the 
potential drop across the discharge was investigated. Figure 4 (a) shows a brass 
specimen vacuum-etched at 4kv and illustrates the grain boundaries due to 
preferential etching, figure 4 (b) shows the etch pits produced in a silicon single 


The Diploma in Technology: Courses in Applied Physics 209 


crystal vacuum-etched at 8kv. The investigation was only partially completed 
and this project will be continued by another student. 

A third project consisted of an examination of binary alloys by contact X-ray 
microradiography. Aluminium-tin alloys containing 1 per cent and 2 per cent of 
tin were cast from a carbon crucible. The resulting alloy was radiographed 


Fig. 4.(a) Brass, cathodic vacuum etched at 4kv. Magnification x 120. 


Fig. 4.(6) Silicon single crystal, vacuum etched at 8kv. Magnification x 1000. 


and the tin was seen to separate out at aluminium crystal grain boundaries after 
annealing. Radiographs taken before annealing showed a dendritic structure 
within the aluminium-rich grains. 

Other students were concerned with an investigation of the behaviour of a 
water jet under conditions of impressed vibration, the measurement of the re- 
fractive index of thin films of sodium iodide, and the field displacement effects 
in ferrite-loaded waveguides. In all, eleven projects were undertaken in the 
period just completed. . 

Any account of a course leading to the Diploma in Technology would be 


C.P. oO 


210 N. Pentland 


incomplete without reference to the very important part which industry is playing 
in providing the periods of industrial training and which are essential to the 
success of this new experiment in technical education. In fact the course is a 
joint effort between the College concerned and the participating firms and we 
have been particularly fortunate in the degree of co-operation we have received 
from the firms and government establishments with which we are associated. 

On entry to the College each student is assigned to a particular member of 
the physics staff who acts as his tutor through his course, during both academic 
and industrial periods. Each tutor has some six to eight students in his group. 
Unfortunately, lack of accommodation at present restricts very much the tutorial 
activities during College periods; we hope that in the near future occupation of a 
new and much more spacious building will enable us to extend this particular 
side of the work. ‘The tutor has, however, important duties to perform during 
the periods of industrial training. He visits each student at his place of 
work on three occasions during each training period, sees the work he is 
doing and discusses this work with him and with his industrial supervisor and 
in particular its relation to the academic part of the course. He then spends 
some time alone with his student dealing with queries which have arisen in his 
guided reading, his physics or mathematics revision problems or with his work 
in general. In this way the College maintains close contact with each student 
while he is away in industry and also forges close links with industrial personnel 
as a result of these frequent visits. It is worthy of note that this process of 
regular visits to industry has had a refreshing influence on the staff concerned 
and has the effect of keeping them up to date with modern developments as they 
occur. 

In a reciprocal manner students are visited by representatives from industry 
during their academic periods. ‘Towards the end of each such period the 
Research Manager, Education Officer, or Technical Training Officer visits the 
College to meet the Head of Department together with the student’s tutor. 
Following an exchange of views on the student’s progress and work the student 
is then called in to the discussion and in particular the job he will do during his 
next period of industrial training is debated and finally agreed. This exchange 
of visits is invaluable in producing a course which, while having two distinct 
aspects, is nevertheless an integrated whole. 

It should be emphasized that an important feature of these new courses so 
far as Colleges are concerned is the freedom which they have to design these 
courses, devise their syllabuses and set their own examinations. As already 
stated, the course and syllabuses are subject to the approval of the National 
Council for ‘Technological Awards and the examinations are subject to the 
approval of an external examiner (or examiners) appointed by the College and 
approved by the N.C.T.A. This places a new responsibility on Colleges where 
in the past courses have usually been directed towards the London external degree. 
Great credit is due to the external examiners, all eminent in their particular 
field, for the help and advice they have given to College staffs in this particular 
work. 

The number of students enrolling for courses leading to the Diploma in 
Technology in Applied Physics continues to increase steadily ; 464 now as com- 
pared with 375 a year ago.[*]_ In my own College we began very modestly with 
8 students five years ago whereas the numbers in the 4 years of the course during 


The Diploma in Technology: Courses in Applied Physics Zi 


the last session were 38 in the first year, 24, 10 and 11 in the subsequent years. 
Initially the College periods of the course began in January each year but at the 
beginning of last session it became necessary to introduce a second stream, 
beginning College periods in September, in order that the 38 first year students 
could be accommodated between the two streams. This arrangement involved 
an overlap of some three months when both streams were in attendance at the 
College and a similar overlap period in industry at a different time of the year. 
As a result of repeated requests from our industrial associates it has been decided 
to adopt, as an experiment, a nearly ‘ end-on’ arrangement as from September 
1961 so that the two streams will begin their College periods in September and 
February each year*. This will have the effect of reducing the overlap period 
to only two weeks. ‘There are many advantages in this arrangement but in the 
main it entails a more efficient use of College accommodation and equipment and 
at the same time enables industry to increase its number of training places by 
having one student at College while another is in industry and then allowing 
these two students effectively to change places. 

According to the latest published reportl*!] (July 1961) of the N.C.T.A. 
forty-eight Diplomas in Technology in Applied Physics have so far been awarded. 
I know from personal experience the calibre of some of the students who gained 
these awards and I am convinced that they will prove themselves very worthy 
of the trust placed in them, and in the courses they followed, by their sponsoring 
firms during the very early days of this new development in technical education. 
There is little doubt that the holders of this new award will rapidly become very 
acceptable and competent members of the staffs of industrial organizations, 
government establishments and research laboratories, and it is hoped that more 
and more headmasters and headmistresses of grammar and public schools will 
bring these courses to the attention of the boys and girls who are likely to profit 


from them. 


REFERENCES 


[1] Technical Education, Cmd. 9703, published H.M.S.O., February 1956. 

[2] Memorandum on the Recognition of Courses in Technical Colleges leading to the 
Diploma in Technology, published National Council for Technological Awards, May 
1956. 

[3] The National Council for Technological Awards, Report for the period April 1960 to 
March 1961. Published 1st July, 1961. 


The Author 
Dr. N. Pentland, M.Sc., Ph.D., F.Inst.P., was educated at Wolsingham Grammar 


School, University College, London, and the London University Institute of Education. 
He has been engaged in technical education for twenty-four years and since 1953 has been 
Head of the Physics Department at Brighton College of ‘Technology. He is a member of 
the London and Home Counties Regional Advisory Committee for Applied Physics and 


Mathematics. 


6 


*Provisional figures for the session 1961-62 are 45, 36 (each year in two “ end-on ” 


streams), 21, 10. 


ESSAY REVIEWS 


Interferometry. A Symposium held at the National Physical Laboratory on 9th, 10th 
and 11th June 1959, (H.M.S.O. 1959.) [Pp. vii+471.] 30s. 


Though some of our familiar types of interferometer are fundamentally those which 
physicists used at the end of last century, there has recently been a marked revival not only 
in the perfection of existing methods but also in the development of entirely new ones. 
Many of the papers read at this symposium review recent work, published but little known, 
and report on the latest progress. ‘This volume thus conveys to the reader a good picture 
of the present state of the subject of interferometry in most of its aspects. 

The periodicity in space of a monochromatic light wave provides an ideal scale, with an 
extremely fine ruling, for measuring distances, a scale which is available in almost unlimited 
length. Interferometry might be described as a vernier method of reading two such scales 
against one another. Since any mathematical interference condition generally depends 
on geometrical quantities—lengths and angles—and on the refractive index and the wave- 
length, all these quantities can be measured by interferometry. ‘The first session was taken 
up in a discourse by F. Zernike which is not included in this volume, and in introductory 
talks by H. Barrell and L. A. Sayce, sketching briefly the range of interferometric methods 
in use at the National Physical Laboratory. 

Part II deals with ‘“‘ Absolute Length Measurement and Light Sources for Inter- 
ferometry’’. ‘The search for better standards of length forms a continuing task for 
metrologists. As methods of comparison become more refined, more accurately defined 
length standards are required. ‘lhe wavelength of the cadmium red line which had taken 
the place of the Paris metre has recently been superseded by the wavelength of the orange 
line, 6058A, of the krypton isotope of mass number 86, as an internationally accepted 
standard of length. ‘lhe accuracy to which this wavelength can be reproducibly measured, 
i.e. compared with other wavelengths, is being studied in standards laboratories of many 
countries. It depends largely on the accuracy to which conditions in the source can be 
specified. ‘Not only the gas pressure but the current density and the direction of observa- 
tion have a detectable influence on the wavelength. An accuracy of 10-8 cannot yet be 
claimed with certainty but appears to be within easy reach now. ‘The present position is 
described by E. Engelhard, whose design of a krypton source has been generally accepted, 
and by kK. M. Baird. Atomic beams, either excited by electron bombardment or used for 
absorbing light, produce spectral lines which are very much narrower than those emitted 
by any discharge tube, and atomic beams are likely to form the sources for the wavelength 
standards of the future. A contribution by K. G. Kessler, R. L. Barger and W. G. 
Schweitzer deals with atomic beams of the isotope 198 of mercury and their suitability for 
establishing length standards using the resonance line 2537a._ The obvious difficulty that 
the wavelength depends critically on the direction of the light beam is overcome in a simple 
and effective way: light is taken out in opposite directions, and the two beams are joined by 
successive reflections. ‘Three papers, by G. W. Series and J. C. Field, by W. C. Martin, 
and by J. ‘Terrien, report new determinations of the wavelength of the helium line 5016a, 
which has been previously used as a reference line for determinations of the Rydberg 
constant. Existing discrepancies have been removed by these new results. New instru- 
ments and methods for interferometric length measurement developed at the Mendeleev 
Institute of Metrology were described by A. I. Kartashev. 

Part III is entitled ‘“ Optical Testing and the Measurement of Relative Position ” and 
deals mainly with three subjects: interference microscopy, moiré fringes and diffraction 
gratings. Under the first heading, M. Fran¢on describes new developments of his method 
of polarisation interferometry, and E. Ingelstam gives a theoretical treatment of the in- 
terpretation of micro-interferograms. H. H. Hopkins reports on experimental studies of 
phases in optical images; he determines the phase by letting the image-forming light 
interfere with a coherent, plane wave. ‘This method gives more information than the 
mere measurement of intensities in the image, and allows detailed comparison with the 
theory, with very satisfactory results. 


Essay Reviews 213 


Moiré fringes in their simplest form are not interference effects but merely shadow 
phenomena observed whenever two periodic structures of slightly different periods are 
placed in contact. ‘Two combs, or two pieces of woven material are familiar examples, and 
it is a matter of every-day observation that a very slow motion of one with respect to the 
other causes the fringes to move very rapidly. Accurate grating replicas are now easily 
available, and moiré fringes as a tool for motion control are becoming of increasing im- 
portance. As a simple example, the uniform relative motion of two similar gratings in 
contact may be mentioned. It causes periodic changes in the intensity of transmitted 
light; if this is made to act on a photo-cell, the constancy of the period of the current lends 
itself readily to electronic control. In his survey of recent advances in this subject, J. M. 
Burch also includes the control of rotations by means of radial gratings. With fine rulings, 
moiré fringes can no longer be regarded as simple shadow effects but have to be treated as 
diffraction phenomena and can be connected with two-beam interference. The inclusion 
of the subject in the symposium thus appears well justified, at least in some of its aspects. 

Under the title “ Interferometry with Electronic Aids”’, G. W. Stroke surveys recent 
advances in the making of diffraction gratings. In the ruling engine at the Massachusetts 
Institute of Technology, the motion of the mirror blank is now controlled by an inter- 
ferometric servo-mechanism. This control is accurate to 1/100 of a fringe in displacement 
and 1/100 of a second in rotation, over traverses of 10 in which are being increased to 
14in. The resolving power of gratings now made on this engine can reach and even exceed 
10°, and the testing of their properties by direct examination of the slit image meets with 
difficulties. The light has to be more highly monochromatic than that obtainable from a 
gas discharge, and the width of the diffraction pattern is microscopically small. By an 
alternative method, the wave front leaving the grating can be tested in a T'wyman—Green 
interferometer and the deviation from a plane wave-front measured. In order to convert 
this information into that of the ‘ slit pattern ’ a Fourier transformation has to be carried out, 
and this is done with the help of an electronic computer (IBM 704). 

Section IV is entitled ‘“‘ Radio, Microwave and Intensity Interferometry”’. A. 
Hewish describes a new type of radio telescope. In order to achieve high angular resolu- 
tion, the receiving aperture of a radio telescope has to be very large. In some instruments, 
this aperture is not entirely ‘ filled’, but composed of portions whose combined action 
forms a sufficiently complete image. ‘The present suggestion goes one step further: pairs 
of dipoles, at various distances, are used successively, and the information from all the two- 
beam interferences is combined by computation in a Fourier synthesis. Economy of 
physical structure is thus obtained at the expense of additional computational effort. K.D. 
Froome describes the use of Michelson interferometers with microwaves. In contrast to 
light, the larger wavelength of these waves introduces diffraction effects which have to be 
allowed for in the calculation, but this can be done accurately. Since the frequency of 
microwaves can be ineasured with great precision, the experiment results in a highly 
accurate determination of the velocity of light. With the same object in view, W. Culshaw, 
J. M. Richardson and D. M. Kerns are developing a Fabry-Perot interferometer for micro- 
waves. The reflecting surfaces are built up from stacks of metal rods acting by means of 
their capacitances. 

An instrument known as an intensity interferometer (for want of a better name) was 
first described by R. Hanbury Brown and R. Q. Twiss in 1954, and the first of these authors 
now reports on the proposed design of such an intrument. It is intended for the measure- 
ment of star diameters. In contrast to Michelson’s stellar interferometer the two beams 
are not brought together directly for interference, but each is made to act on a photocell. 
The photo-currents are proportional to the intensities, not the amplitudes, of each beam. 
The correlation between the intensity fluctuations of the two cells is measured by means of 
amplifiers and allows the degree of coherence of the light to be determined. ‘The instru- 
ment is restricted to fairly bright stars but can be used for greater lengths of base than 
Michelson’s interferometer. 

The last section, on ‘‘ Spectroscopic and Photoelectric Interferometry ” deals with a 
subject that was covered by an earlier symposium at Bellevue in 1957 and reports some 
developments of detail rather than fundamentally new progress. S. Tolansky and 1D, Ne 
Bradley report on the construction of a scanning Fabry-Perot interferometer with an 
oscillating plate, as described by Chabbal and Soulet, and A. H. Cook deals with the use of 
interferometers with pressure scanning in metrology. P. Connes reports on the first 


214 Essay Reviews 


practical experiences with a method of spectroscopic interferometry which he recently 
invented. ‘'T'wo similar plane gratings are used as reflectors in a Michelson interferometer ; 
by means of rotation of the compensating plate and simultaneous, slow rotation of the 
gratings, a modulated output signal is produced which is almost identical with a recording 
of the spectrum of one of the gratings, as obtained by conventional methods. Compared 
with the latter, however, the solid angle of the beam is increased, with gain of intensity 
which makes the method especially useful for the infra-red. 

The last two papers deal with a method of interferometric spectroscopy which was 
invented by Michelson and was, for some decades, merely regarded as a kind of curiosity of 
some historical interest. Developments in detection methods and requirements of astrono- 
mers for work in infra-red light have recently brought this method of ‘ visibility curves ’ to 
the foreground again. If one of the reflectors of a Michelson interferometer is displaced 
at a constant speed, the light intensity as a function of time will be a pure (cos”) function for 
monochromatic light and can be recorded photo-electrically. For light of arbitrary 
composition, it will be a Fourier transform of the spectrum. ‘The latter can be computed 
from the output function by means of the inverse Fourier transformation, though the 
accuracy of the result will be limited by the finite length of the traverse of the mirror which 
restricts the integration to a finite range. The chief advantage is the great light-gathering 
power. ‘This is partly due to the great angle of admission common to all interferometers 
of axial symmetry, partly to the fact that all wavelengths act simultaneously on the photocell 
all the time. ‘This makes the method attractive for infra-red spectroscopy, the application 
to which is described by H. A. Gebbie. J. Terrien discusses its application to the study of 
line profiles; for symmetrical lines it provides a powerful and comparatively simple method. 


H. G. Kuun. 


The Fermi Surface. (Proceedings of an International Conference held at Cooperstown, 
N.Y.) Edited by W. A. Harrison and M. B. Wess. (New York and London: John 
Wiley & Sons Inc., 1960.) [Pp. xvi+356.] 80s. 


The first theoretical treatments of the behaviour of electrons in metals came soon after 
the discovery of the electron. ‘The picture of the electrons as particles of a classical gas 
moving freely through the metal had its successes, but raised some difficulties (especially in 
discussions of the specific heat), which were resolved only when quantum mechanics came 
along. ‘The most important single consequence of the revised free-electron theory was that 
only a very small proportion of the conduction electrons governed the response of the metal 
to thermal, electrical, or magnetic disturbances—those in a narrow range of energies at 
the top of the occupied region of the system of energy levels. In this theory however, 
since it ignored the periodic potential of the crystal lattice, the only variable characterising 
a particular metal was the number of conduction electrons per unit volume, a quantity 
varying through the metallic elements of the periodic table very much less than the observed 
electrical, magnetic and thermal properties. Most of the observed properties, with the 
notable exceptions of ferromagnetism and superconductivity, could be discussed in general 
terms; but qualitative agreement between experiment and theory seemed almost to be re- 
stricted to the alkali metals. 

When the crystal potential was introduced into the theoretical models its effects could 
be seen most clearly by representing the electronic structure by a three-dimensional plot 
of energy as a function of wave-vector. In this representation the constant-energy surface 
enclosing all occupied states at absolute zero has come to be known as the Fermi Surface 
In the free-electron model this surface was always a sphere, and the wave-vector oO 
simply related to the velocity of an electron: in the presence of the crystal potential constant- 
energy surfaces are not, in general, spherical, the Fermi Surface may be intersected by 
planes across which the energy changes discontinuously, and the velocity of an electron 
on this surface and its response to external fields may be very different from those of a 
free electron. Any attempt, therefore, to explain or predict properties of a particular metal 
must take into account the shape and properties of its Fermi Surface. This has the 
consequence that we can speak with comparative confidence of the properties and electronic 


Essay Reviews AAS) 


structure of a useless metal like sodium, while there are few more mysterious metals than 
iron! For the technologically important metals are those in which large numbers of electrons 
per atom are influenced by the other atoms in the crystal. 

It should not, however, be thought thar only utilitarian considerations brought together 
the group of ninety solid state physicists whose deliberations at an International Conference 
held at Cooperstown N.Y. a year ago are published in full in the book under review. (One 
of the many satisfactory features of this record of the conference is the speed with which it has 
been produced.) This field provides some fascinating theoretical problems, and has been 
made more exciting in recent years by the development of a large number of elegant 
experimental techniques which can give surprisingly direct informaton about the 
electronic structures of metals. 

It would be pointless to attempt here to indicate in any detail what is to be found in 
this book, and I shall try instead to give some account of the things people are doing, 
illustrating this by reference to particular papers that were presented. The book itself has 
extremely useful summary articles, a theoretical one by M. H. Cohen, and an experimental 
one by A. B. Pippard, in which the problems, growing points, and possibilities of the field, 
as revealed at the conference, are reviewed. 

In the general theory of electrons in metals that is used as a basis for most discussions 
of Fermi Surfaces interactions between electrons (except those of a very general nature) are 
ignored, but theoreticians are by no means convinced that this is a valid procedure. Efforts 
are therefore being made to see what features of the Fermi Surface will remain when the 
electrons are no longer treated as independent particles, and the opening theoretical session 
presented some of the conclusions reached by Kohn, Luttinger, and others. It also showed 
how interest and confidence in attempts to calculate the energy-band structures of metals 
are reviving as newer mathematical techniques are introduced. (For a number of years 
uncertainties about the choice of a realistic crystal potential for the electrons and the range 
of validity of approximations had caused many people to regard such calculations as little 
more than mathematical exercises.) Another revival has been the nearly-free-electron 
approximation in which conclusions about the forms of Fermi Surfaces are based on the 
introduction, into the wave vector space of free-electrons, of small energy discontinuities 
across Brillouin zone boundaries. W. A. Harrison has been especially concerned with 
showing how this approach, first used by H. Jones in the 1930’s, can be justified and applied 
to polyvalent metals with useful results; and he provides an easily-read account of such 
work. 

The experimental techniques which can provide information about the Fermi Surfaces 
of particular metals and the properties of electrons on such surfaces have grown in number 
but some of the best established are those which depend on the quantization in a magnetic 
field of the motion of electrons in a plane perpendicular to the field. This leads to a periodic 
variation of the magnetic susceptibility with H~!; and this effect, the de Haas—van Alphen 
effect, has been extensively investigated by Shoenberg, who gives an account of the way in 
which information on extremal cross-sectional areas of Fermi Surfaces has been obtained 
from it. Older work at low fields gave indications of the shape of small pieces of Fermi 
Surface in complicated metals like bismuth, and recent work in high fields on copper, silver 
and gold has yielded information on the way in which their Fermi Surfaces are distorted 
from a spherical shape to touch planes of energy discontinuity. 

It was unfortunate that none of the Russian solid state physicists was able to attend 
the conference, for they have made some very great contributions to this field. In particular 
the relationship between the connectivity of Fermi Surfaces and the high-field behaviour of 
magneto-resistance has been explored theoretically by Lifshitz, Peschanski, and their 
co-workers, and experimentally by Gaidukov and others. Chambers reviewed this work and 
also discussed the less unambiguous information that can be derived from measurements 
at lower fields. (In all such explorations of Fermi Surfaces by methods which cause 
electrons to make orbits in real space while their wave vectors make orbits over the Fermi 
Surface a field is ‘ high’ if the radius of the electron orbit is of the order of the mean free 
path at the temperature of measurement.) Quantization effects, like those giving the de 
Haas-van Alphen oscillations, can be seen in the magneto-resistance of metals like bismuth. 

Last year Contemporary Physics published an article on cyclotron resonance in metals 
by Kip, and at the conference he described the work of this type that has been carried out on 
metals since Azbel’ and Kaner discussed theoretically its observation in such materials. 


216 Essay Reviews 


In this experiment electrons in orbit in a steady magnetic field can resonantly absorb energy 
from a radio-frequency electromagnetic field as they enter the skin depth of a sample whose 
surface is parallel to the magnetic field. From the values of field and resonant frequency 
one can calculate the average effective mass for the electron orbit, a quantity depending on 
the second derivative of the energy with respect to the wave-vector. This type of experi- 
ment does not, therefore, indicate the shapes of Fermi Surfaces directly but gives valuable 
information about the states lying on it. It has been applied with success to copper, 
aluminium, and bismuth; and experimental data have been obtained for a number of other 
metals. 

Of the various effects that can now be used to give geometrical information about the 
Fermi Surface, the first actually to be used was the anomalous skin effect, but Pippard, its 
originator, would not now regard it as very suitable for tackling more complicated metals. 
The use of this effect relies on the fact that when the mean free path of conduction electrons 
is very long compared with the resistive skin depth for the radio-frequency field used to 
measure the surface impedance, this quantity is related very directly to the shape of the 
Fermi Surface. This topic was reviewed at the conference by G. E. Smith; and new 
results on aluminium were presented by Fawcett and related to the nearly-free-electron 
model. 

A fruitful development of ultrasonic work has been the use of 10-200 Mc/s ultrasonics 
for the study of solids, one aspect of which is the derivation, from the magnetic field depen- 
dence of ultrasonic attenuation, of dimensions of the Fermi Surface. ‘This again requires 
very pure metal samples with electron mean free paths that are long compared with the 
ultrasonic wavelength, which serves as a sort of calipers with which extremal diameters 
of the surface can be measured. Morse presented some very elegant data on the noble 
metals, which are in good agreement with the results of Shoenberg referred to above, Pippard 
dealt in a persuasive manner with some of the rather difficult theoretical aspects, and experi- 
mental contributions from a number of other workers showed that this is becoming one of 
the more fashionable techniques. 

None of the other types of experimental data discussed gives information as direct as those 
so far mentioned, but it has been extremely interesting to see how older speculations about 
Fermi Surfaces based on ‘ classical’ properties have sometimes been justified and sometimes 
overthrown by the new methods. ‘This type of speculation will continue, for many of the 
most interesting metallic materials are alloys in which the long mean free paths required by 
the new methods will never be achieved. It is therefore to be hoped that optical work like 
that described by Rayne, and ordinary transport measurements like those discussed by 
Ziman will continue to be provided, and theoreticians will go on attempting to unravel 
anisotropies of Fermi Surfaces and anisotropies of relaxation times. 

An account of the conference and its record would not be complete without reference 
to the lively and extensively recorded discussions. These, together with the summary 
articles mentioned above, will be of interest even to the non-specialist as signs of the way 
people are thinking in this field. In general, however, this is a book for the specialist. 
‘The physicist who wishes to learn about the electronic structures of metals and the ways 
of investigating them experimentally would do best to read first one of the standard older 
textbooks followed by the excellent review given by Pippard in the 1960 Reports on Progress 
in Physics. For the metal physicist however the well-produced and unabridged record of 
this conference will prove extremely valuable for some time to come. Bo RetGoOnes: 


The International Conference on Cosmic Rays and the 
Earth Storm, Kyoto, Japan. September 3-15, 1961 


by P. L. MARSDEN 
Physics Department, University of Leeds 


The conference consisted of two international scientific meetings which 
were held consecutively. The first of these, the International Symposium on the 
Earth Storm was directed chiefly to magnetic storms, aurorae and ionospheric 
disturbances. This was followed by the International Conference on Cosmic 
Rays which discussed such features of the cosmic radiation as composition, 
origin, extensive air showers, muons and jets. Also, joint meetings were arranged 
on topics of common interest with papers on radiation belts, solar radiation, 
modulation of the cosmic radiation and the interplanetary plasma. 

The conference at which some 27 countries were represented by nearly 400 
delegates was extremely well organised by the Science Council of Japan. The 
meetings were divided into ordinary sessions, and plenary sessions at which 
contributions to the ordinary sessions were summarised and reviewed. In view of 
the large numbers of papers there is only space here to mention a few of the 
outstanding features of the conference. 


THE EARTH STORM 


The honour of presenting the first paper was accorded to S. Chapman, 
who reviewed the history of our knowledge of earth storms together with present 
views and future prospects in this field of research. 

At the first plenary session E. H. Vestine described the various phases of the 
geomagnetic field changes during magnetic storms. ‘The sudden commencement 
(SC) to within the accuracy of present observations occurs simultaneously over 
the whole world and is associated with an increase in the northerly component of 
the earth’s magnetic field of some tens of y (ly=10~° oersted). After the SC the 
initial increase of field persists for a few hours before the onset of the main phase, 
in which the magnetic intensity decreases below the pre-storm value. Recovery 
from the storm is usually complete after a few days. Satellite observations with an 
important bearing on mechanisms to account for storms were reported by 
E. J. Smith and C. P. Sonnett, who showed that at a distance of about 4 earth 
radii (Re) the main phase and recovery field changes were coincident with the 
storm field measurements at the surface of the earth, but the decrease at 4Re was 
some 2:5 times greater than at 1Re. The authors suggested that a ring current of 
radius about 6Re in the magnetosphere may be responsible for the storm field. 

A ring current to explain the main phase of the storm was invoked in several 
papers on the theory of storms which were reviewed in plenary sessions by 
S. F. Singer and E. N. Parker. In principle, if not in detail, it now seems 
agreed that the most plausible storm model is one in which the SC is caused by 
the arrival at the earth of solar plasma, which according to Singer then propagates 
hydromagnetic shock waves along the lines of force to the auroral zones setting 
up ring currents in the ionosphere which are responsible for the rapid onset and 


218 P. L. Marsden 


the increase of the field during the initial phase of the storm. According to 
Parker the continued pressure of the solar plasma simply compresses the geo- 
magnetic lines of force thereby increasing the surface field. The main phase 
is attributed to trapped protons drifting azimuthally around the earth which 
draw the geomagnetic field lines outwards, causing a decrease in the surface field. 
The position of this trapped radiation is put at between 3 and 5 Re, not quite in 
agreement with the suggestion made by Smith and Sonnett. The trapped protons 
are considered to be present in the magnetosphere before the storm occurs, but 
during the initial phase of the storm they are accelerated hydromagnetically to 
have sufficient energy to account for the observed changes of the magnetic 
field. The decay phase is accounted for by a process of charge exchange in which 
the fast protons capture electrons from neutral hydrogen to become fast neutral 
particles which are no longer constrained to the lines of force and escape. ‘The 
presence of a belt of trapped protons of the radius required by this theory has 
yet to be detected experimentally. 

Detailed information of the electron belt was given by J. A. Simpson, 
C. Y. Fan and P. Meyer who reported on an investigation into the structure 
of the outer belt occurring at distances between 3 and 5Re using shielded pro- 
portional counters on the Explorer VI satellite. It has been known for some time 
that the outer belt consists of energetic electrons with energies up to about 
1000 kev, and that the characteristics of this belt change considerably with 
time and solar activity. The origin of these high energy electrons is not yet 
clearly understood. Simpson reported that two distinct peaks of electron 
density were identified and found to persist during the period of the experiment. 
These peaks were found to undergo considerable radial motion during magnetic 
disturbances but at all times the spatial distribution of the trapped electrons 
followed the dipole lines of force. ‘This is an important observation for it shows 
that at all times, even during disturbed periods, the electron energy density 
within the belt is always less than the energy density of magnetic field in the region. 
The observed energy spectrum of the electrons is found to be consistent with that 
expected from the acceleration of ambient electrons which may have arisen from 
neutron beta decay, but Simpson considers that other injection processes must 
also be operative to account for the spatial distribtuion of the electrons. 

Several interesting features of the solar plasma were reported in a paper read 
by H. S. Bridge who gave the preliminary results of the MIT plasma probe 
experiment. This probe, carried in the Explorer X satellite, consisted of a 
Faraday cup which measured the positive ion current due to the proton com- 
ponent of the plasma. By means of suitable modulating grids in front of the 
cup it was possible to determine the energy of protons over the range 5 ev to 
2kev. Observations on the density and energy spectrum of the protons were 
obtained between 1 and 35 Re. Up to about 3 Re some difficulty is experienced 
in the interpretation of the results due to out-gassing of the probe, but relatively 
cold plasma was detected. Between 3 and 25 Re no plasma was detected (in 
agreement with Russian observations), but further out there was a measurable 
flux of protons corresponding to about 10 proton/cc, with an energy distribution 
peaked at about 500 ev. Although the protons were coming predominantly 
from the direction of the sun the density was found to vary considerably, in some 
regions of space being less than 0-1 proton/cc. At the time of a SC detected at 
the earth a higher energy solar plasma was detected. On board the same satellite 


The International Conference on Cosmic Rays and the Earth Storm 219 


were two magnetometers, and J. P. Heppner reported that at large distances the 
magnetic field was patchy, varying between a relatively stable value of about 20 y 
and a fluctuating field of about 4 to 40 y. B. Rossi pointed out that in the 
regions in which there was a large proton density the magnetic field was weak and 
fluctuating but where the field was steady the proton density was low. The 
dimensions of these regions was given as about 104 cm and Rossi showed that 
the total energy density in both regions is roughly the same, for one region 
appearing almost entirely as kinetic energy of the protons and in the other as 
magnetic energy associated with the field. One peculiar and unexplained feature 
found in this experiment was that protons detected following a flare on the sun 
had a measured energy corresponding to a velocity of 400 km sec-! whereas 


from the observed transit time of the plasma, their velocity was estimated to be 
104 km sec7}. 


Cosmic Rays 

During the past decade considerable effort has been devoted to experiments 
designed to determine the position of the source of the cosmic radiation by study- 
ing the spatial distribution of the high energy particles arriving at the earth. 
If the distribution is asymmetrical, then because of the earth’s rotation there 
should appear a periodic variation in the intensity of the detected radiation and 
for the high energy particles which suffer least deviation in the interstellar 
magnetic fields, the direction of maximum intensity would be expected to 
correspond to the direction of the source. In the past, the results for primary 
particles of energy about 10!® ev have been inconclusive and therefore the 
results presented by J. Linsley for particles with energy in excess of 101% ev 
were of particular interest. The particles were detected through the extensive 
air showers they produce in the atmosphere. The detector consisted of an array 
of nineteen 3-3 m? plastic scintillators spread over an area of 2km*. Timing 
of the passage of the shower front across the array was used to determine the 
direction of the primary particle initiating the shower. ‘Twenty showers have 
been detected which were estimated to be due to primaries of average energy 
4 x 1018 ev and for this group of events the arrival directions were found to be 
anisotropic indicating that the source of the enhancement lies in a plane per- 
pendicular to the local spiral arm of the galaxy. For particles of average energy 
8 x 1017 ev the anisotropy is much less marked but there does appear to be a 
pronounced minimum in the intensity around right ascensions of 14 to 16h. 
A minimum in the same direction was also reported by K. Greisen from the 
results of the Cornell extensive air shower experiment. Linsley concludes that 
these effects suggest that the most energetic particles tend to circulate within the 
spiral arm in low pitched spirals. 

If the high energy events reported here were due to protons then, for 
reasonable values of the galactic magnetic field (~ 3 x 10~° oersted), the radius of 
curvature of the proton trajectories would have been as great as the galactic halo; 
if they were due to iron nuclei the radius of curvature would have been twenty 
times smaller, but Linsley considers that even this is not small enough to contain 
the particles within the galaxy. The most energetic particle detected so far in 
these experiments was estimated to have had an energy of 6 x 10 ev. 

Closely connected with the problem of a meta-galactic origin for part of the 
cosmic radiation is the question of whether the energy spectrum of the primary 


220 P. L. Marsden 


radiation is continuous or shows a break at these very high energies. A 
definite break would be interpreted as showing the detection of meta-galactic 
particles. On this matter Linsley reported that there was no conclusive evid- 
ence for a break in the spectrum up to 107 ev. 


On the basis of radio-astronomical observations V. L. Ginzberg has developed 
a theory of the origin of the cosmic radiation in which the cosmic rays are formed 
during the eruption of super-novae. It is assumed that cosmic ray particles, 
including electrons, are accelerated in the expanding shells of super-novae, and 
Ginzberg has deduced the galactic flux of electrons which would arise in this 
way. In addition there is also the possibility of a second source of electrons 
from the interactions of cosmic ray particles with interstellar matter creating 
a-mesons which then give electrons via the 7-u-e decay. If electrons come princi- 
pally from the second effect there should be roughly equal numbers of positrons 
and electrons inthe cosmic ray flux but for beta particles produced at super-novae 
there is the possibility that there might be an excess of electrons if for example 
these arose chiefly from the beta decay of neutron rich nuclei. Likewise there 
should be a measurable flux of y-rays from 7° decays both from the source 
region and from interstellar matter. [Wo papers were presented describing 
experiments designed in one case to detect electrons and in the other y-rays. 


P. Meyer has investigated the electron component using scintillators 
flown in rockets. He reported that the flux of electrons with energies between 
100 and 1300 Mev was 35 electrons (cm? sec strd)~, a value not inconsistent 
with the predictions of V. L. Ginzberg, but it is difficult to say whether this is 
due to a galactic flux of electrons or to electrons which have been produced within 
the solar system. It would appear important to repeat this and similar experi- 
ments during the coming solar minimum (1964-65) when the contribution from 
solar electrons should be minimized. As yet there is no experimental evidence 
concerning the electron-positron ratio. 


W. L. Kraushaar and G. W. Clark reported on their attempt to measure the 
omnidirectional y-ray flux using a detector carried on the Explorer XI satellite. 
The results obtained indicated the presence of a y-ray flux, presumably from 
nuclear interactions with interstellar matter, which was not in quantitative 
agreement with the predicted value. No data were reported on the flux of y-rays 
from discrete sources. 


Also concerned with the origin and nature of the source of cosmic radiation is 
the accurate determination of the chemical composition of particles arriving at 
the earth. In order to deduce the abundances of the source elements, it is 
necessary to take into account the fragmentation of the particles on their journey 
through interstellar matter. Observations on the abundances of the light group 
of elements Li, Be and B, whose universal abundances are so low that their 
presence in the flux of cosmic ray particles must be attributed almost entirely to 
the fragmentation of heavier elements are particularly important in this connec- 
tion. ‘There has been considerable controversy in the past over the measured 
values of the abundance of this group of elements but workers in this field are at 
last agreed that for particle energies greater than 4-5 Gev the ratio of the flux 
of light particles (Li, Be, B) to that of the heavy particles (Z > 10) is 0:21 + 0-05, 
corresponding to an average life time for light particles of about 4 x 108 years. 
V. L. Ginzberg has shown that such a value agrees well with the assumption 


The International Conference on Cosmic Rays and the Earth Storm 221 


that only heavy nuclei, i.e. those greater than Z > 10, are accelerated in the region 
of the source. 

Some twenty papers were devoted to the cosmic ray intensity changes 
following solar flares, most of them relating to the unique series of events in 
November 1960, when within the space of three days the same sunspot group 
twice emitted high energy particles. With the increasing availability of rockets 
and satellites it is now clearly established that in addition to the ejection of high 
energy protons the sun also emits heavier nuclei during these flare events. 
For example during the November 1960 disturbances, a few particles with Z > 10 
were detected and the flux of carbon nuclei was about one hundred times that 
during periods of normal activity. Perhaps of interest to space travellers were 
the results given by E. P. Ney and W. Stein who found for the same flare 
events that the radiation levels rose as high as 0:7R h-!. 

The conference was brought to a close by C. F. Powell and B. Rossi who 
reviewed in turn ‘ The Future Prospect ’. Powell discussed the réle to be played 
by cosmic ray investigations into nuclear interactions at energies in excess of 
those available from machines and Rossi indicated that with the increasing use of 
satellites we may look forward to a rapid improvement of understanding of some 
of the geophysical problems of the cosmic radiation. 


The Author : 
Dr. P. L. Marsden is a lecturer in the Physics Department at the University of Leeds. 
His main interest is in the study of the cosmic radiation. 


Kyoto 1961. The International Conference on Magnetism 


by E. W. LEE 
University of Sheffield 


1. INTRODUCTION 

From September 25th—30th, 1961, the International Union of Pure and 
Applied Physics held an [nternational Conference on Magnetism and Crystallo- 
graphy in the magnificent new Kyoto Kaikan, the civic centre of the ancient 
capital of Japan. About 850 delegates attended the conference of whom 600 
were from Japan itself. During the five days some 320 papers were read in 
three parallel sessions ; 220 on magnetism and 100 electron and neutron diffraction. 
In addition two joint sessions were held at which studies of magnetic materials 
using neutron diffraction methods were reported. 

Japanese interest in magnetism originated solely with Sir James Alfred Ewing 
and his researches in magnetism as Professor of Mechanical Engineering at the 
University of Tokyo from 1878 to 1883. Afterwards independent studies were 
undertaken in Japan by Nagaoka, Honda, Kaya and many others whose writings 
on their researches in magnetism rapidly assumed the status of standard works. 
Today an enormous quantity of research covering virtually every branch of 
magnetism is being carried out at excellently equipped laboratories both in 
universities and in industry by an expanding group of young scientists whose 
enthusiasm for their subject was made apparent to every one who was present 
at the conference. 

In the following report a selection of papers has been discussed, some fairly 
fully. ‘The choice of material, although aiming at a wide coverage,* nevertheless 
cannot help reflecting the writer’s own interest somewhat and is to this extent 
unrepresentative. 


2. RARE EARTH METALS 

In recent years it has been possible to prepare pure samples of the rare earth 
elements, namely, those whose atomic numbers range from 57 to 71. Their 
unique properties arise from the possession of an unfilled 4f shell in their electron 
configuration. As the atomic number progresses from 57 (lanthanum) to 71 
(lutectum) the number of electrons in the 4f shell increases from 0 to 14 but the 
outermost or valence shell configuration remains unchanged. ‘The chemical 
properties of these elements are thus virtually identical and many of their physical 
and metallurgical properties are quite similar. The magnetic properties are 
directly related to the number of 4f electrons and consequently differ appreciably 
from one element to another. 

At present the greatest interest attaches to the heavy rare earth metals 
from gadolinium (64) to thulium (69). These all crystallize in a simple hexagonal 
close-packed structure. Gadolinium is ferromagnetic below 17°c and _ its 
properties seem to be in no way different from, say, cobalt. The remainder 
behave quite differently, becoming antiferromagnetic below a certain critical 
temperature (called the Néel temperature) and ferromagnetic at still lower 


* Some of the papers presented during two sessions headed “ Theory’’ have not 
received the attention they deserve. The writer felt, however, that those whose chief 
interests lie in theoretical developments were unlikely to be interested in an experimental 
physicist’s account of the wav he thinks the conjurers perform their magic. 


The Kyoto Magnetism Conference 223 


temperatures. ‘he magnetic structure in the antiferromagnetic phases has been 
determined by Wilkinson, Child, Koehler, Cable and Wollan (U.S.A.) using 
neutron diffraction. For dysprosium, holmium and terbium it has been 
established that the magnetic moments always lie in the basal plane perpendicular 
to the c-axis. In any given plane the moments are all parallel but the direction 
of the magnetic moment changes successively from one plane to another by a 
fixed angle forming a helical structure (fig. 1(b)). The pitch of this helix can 
be measured and is observed to be temperature-dependent. 

The structure of erbium is much more complicated. At 80°K the moments 
are aligned parallel to the hexagonal axis but vary sinusoidally in magnitude 
along this axis. Below 80° the neutron diffraction pattern indicates the presence 
of a helical structure with components of the moments perpendicular to the 
hexagonal axis. Finally at 20°K the structure becomes a ferromagnetic helix 


(fig. 1). 


= ie. eons 
paren | 


(a) Ferromagnetic (b) Antiferromagnetic | Ferromagnetic helix Longitudinal 
helix spin wave 


Fig. 1. Diagram of magnetic orderingsfound Fig. 2. Magnetic ordering in erbium; below 
by neutron diffraction studies in terbium, 20°K it is ferromagnetic but with a helical 
dysprosium, and holmium. (a) Ferromagnetic, structure having a net moment along the 
below the Curie temperature, which is 218°k hexagonal axis. At 80°K the neutron diffrac- 
for Tb, 85°K for Dy, and 20°x for Ho. (6) Anti- tion pattern indicates that all the moments 
ferromagnetic, between the Curie temperature lie parallel to the hexagonal axis but vary in 
and the Néel temperature, which is 230°K for magnitude with position (z) along this axis 
Tb, 179° for Dy, and 133°x for Ho. as sin wz, where w is a_ characteristic 
wavelength. 


Preliminary results for thulium suggest that at 53°K the moments are similar 
to those in the high temperature phase of Er; at 33°K there is evidence of a change 
also similar to that of Er. Thulium becomes ferromagnetic below 22°K. 

Some preliminary observations by the Oak Ridge group on the lighter rare 
earth metals have shown that neodymium (60) is antiferromagnetic below 4-2°K 
and cerium (58) antiferromagnetic below 12:5°K. Sufficient data are available 
to show that the antiferromagnetic ordering is quite different in the two cases 
but a precise determination of the structure is not yet possible. 

The behaviour of an antiferromagnetic helix in the presence of an external 
magnetic field has been investigated theoretically by Nagamiya, Nagata and 
Kitano (Japan). A field parallel to the screw axis merely turns all the magnetic 
moments through the same small angle towards the field. In the presence of a 
field perpendicular to the screw axis the helix first deforms slightly, but such that 


224 E. W. Lee 


the susceptibility is constant. If this field is increased a critical field is reached 
at which the moments which initially made a large angle with the field swing 
round so as to become almost parallel to it. Finally, complete saturation is reached 
at a definite field strength approximately twice the critical field. Thus the 
characteristic feature of the magnetic behaviour of substances possessing helical 
spin arrangements (the compound Au,Mn is another) is that in weak fields they 
exhibit antiferromagnetic behaviour; at high field strengths however they 
show the saturation effects that are normally associated with ferromagnetism. 


3. METALS AND ALLOYS 

The magnetic properties of alloys are of interest, not only in themselves, 
but for the information which they may yield about their constituents. ‘lhe 
most powerful investigational methods employ a combination of purely magnetic 
measurements and neutron diffraction studies. However, this is not always 
possible and a surprisingly large amount of information can often be extracted 
from the magnetic or neutron diffraction data alone. 

Alloys containing manganese frequently exhibit unexpected properties of 
a quite bewildering complexity. Manganese in metallic systems appears to be 
capable of exhibiting a magnetic moment ranging from 0) to 5 Bohr magnetons 
depending on the nature of the second metal. Moreover the interactions between 
manganese may be either antiferromagnetic or ferromagnetic. lor example 
copper-manganese alloys are frequently antiferromagnetic; ordered Cu,Mn is, 
however, ferromagnetic. «-Mn is antiferromagnetic but its complex crystal 
structure precludes a unique spin assignment to the system. Sato and Arrott 
(U.S.A.) suggest that one atom only on the primitive unit cell of 29 atoms 
should be regarded as carrying the moment of 5y,. ‘Iwo investigations on 
gold—manganese alloys were reported. Jacobs, Kouvel and Lawrence (U.S.A.) 
measured the magnetization curves of Au,Mn in pulsed fields up to 
140 kilo-oersteds. ‘They suggest that disordered Au,Mn possesses a kind of 
mixed ferromagnetic—antiferromagnetic state arising from ferromagnetic and 
antiferromagnetic interactions between Mn atoms of different separation. 
Ordered Au;Mn is antiferromagnetic with a Néel temperature of about 150°k. 
This observation was confirmed by Sato, Hirone, Watanabe, Maeda, Adachi, 
Kaneko and Kondo (Japan) who, however, find strong evidence for antiferro- 
magnetism in the disordered state with a Néel temperature of 50°k. 

Mn—Zn alloys have been investigated by Nakagawa, Sakai and Hori (Japan). 

The phase diagram is complex but a hexagonal close-packed structure, stable 
at high temperature can be retained by quenching. Such alloys containing 
15% (atomic) and 25%, Mn are ferromagnetic. A face-centred cubic alloy 
containing 27 at. % Mn annealed at 100°C becomes gradually ferromagnetic 
at low temperatures; annealed at 250°c the same alloy appears to be antiferro- 
magnetic with a Néel temperature of 150°x. 
: Pure chromium is now known to be antiferromagnetic. ‘The Néel temperature 
is 37°c and the antiferromagnetism is suppressed by small amounts of impurity. 
Shirane and Takei (U.S.A.) showed that at low temperatures (78°K) the spin 
structure is that of a sinusoidal modulation of the magnetic moment exactly the 
same as that observed in Er between 52° and 80°k (see §2). However at 120°x 
Gr undergoes a transition involving a change in the direction of the spin ordering. 
The precise structure above this temperature is not known for certain. A helical 
structure has been suggested. 


—_ 


The Kyoto Magnetism Conference 225 


Alloys containing transition elements have always been a target for magnetic 
probing. Those involving uranium have been of interest since the discovery 
that the intermetallic compound UMn, is antiferromagnetic whilst UFe, is 
ferromagnetic. This work has been extended by Hamaguchi, Komura, Kunitomi 
and Sakamoto (Japan) who find that UNi, and UCo, are both paramagnetic 
down to 150°K. However the magnetic moment of UFe, is small and corresponds 
to only 1-134, per molecule of UFe, (cf. 2:24, for Fe). Neutron diffraction 
investigations rule out the possibility of an antiferromagnetic arrangement and 
the authors conclude that both U and Fe atoms possess magnetic moments in 
this compound and that the spin arrangement is simply ferromagnetic. 

Studies on the Fe-Rh system were reported by Bertaut, Delapalme, Forrat, 
Roult, de Bergevin and Pauthenet (France). In the range from 50-55% Rh the 
alloys exhibit transitions between antiferromagnetism and ferromagnetism similar 
outwardly to that observed in some rare earth metals. However in the Fe-Rh 
alloys it is the antiferromagnetic phase which is stable at low temperatures and this 
suggests that such similarity should be treated with caution. In particular the 
Fe-Rh alloys are cubic and the absence of a crystallographic axis of symmetry 
precludes an antiferromagnetic screw axis such as is found in the rare earth metals. 


4. OXIDES AND OTHER COMPOUNDS 

In addition to the ferromagnetic metals and alloys there exists a large number 
of compounds whose properties are of considerable interest both from a 
fundamental standpoint and because of their potential technical applications. 

Blasse and E. W. Gorter (Netherlands) described the magnetic properties 
of spinels with compositions NiFeg_,V,O,. These are all ferrimagnetic and 
for 0-6<x<0-7 exhibit a compensation temperature at which the magnetic 
moments of the sub-lattices become equal and opposite and about which the 
direction of the spontaneous total moment changes sign. ‘The actual moments 
are less than those to be expected from Néel’s theory and it is suggested that the 
moments within the octahedral sub-lattice cannot be parallel to one another. 

Distortions in spinels and other oxides were the subject of several papers. 
Normally the spinel structure is cubic but in compounds containing Cu?+ and 
Mn+ ions it undergoes a slight distortion to tetragonal. This is usually explained 
in terms of the Jahn-Teller effect; if the orbital ground state of an ion is 
degenerate in a cubic crystalline field the energy of the system can be reduced 
by a crystal distortion. Such distortions exist in copper ferrite-aluminate 
(Miyahara, Japan), in nickel chromite (‘Tsushima, Japan) and also in the system 
Mn,Crq_,)O» (Siratori and lida, Japan). A purely magnetic distortion (magneto- 
striction) in CoO and NiO single crystals has been extensively studied by 
Nakamichi and Yamamoto (Japan). In these crystals the moments are 
ferromagnetically aligned within (111) planes but adjacent (111) planes are 
antiferromagnetically coupled. The spins are allowed by the crystal symmetry 
to take up several different orientations and form antiferromagnetic domains 
within the (111) planes. Application of a magnetic field causes movement of 
the domain boundaries and an associated magnetostriction which is of the same 
order as that found in ferromagnetic substances. 

Haematite-«.Fe,O, and ilmenite-FeTiO; form a complete range of solid 
solutions. ‘This system has been the subject of an intensive study by Ishikawa 
(Japan). If the system is denoted by Feo_,)li,03 then the most interesting 

C.P. P 


226 E. W. Lee 


feature of the system is the appearance of strong ferrimagnetism in the limited 
range of composition for which 0-6<«<0-9. Neutron diffraction studies show 
this to be the result of ordering of Fe and Ti ions along the [111] rhombohedral 
axis and with antiparallel coupling between spins on adjacent (111) planes, 
but only for 0-6<x<0-8. No coherent magnetic scattering is observed in the 
neutron diffraction patterns for x=0-88 even at 4°K. Nevertheless, this 
composition shows a considerable magnetic moment at this temperature. It is 
suggested that the ferrimagnetic ordering exists only in clusters. Magnetic 
measurements were made on specimens with « =0-92, 0-88, 0-83 and 0-79. The 
first three show superparamagnetic behaviour from which it is deduced that 
for x=0-92 and «=0-88 each cluster contains 11 and 56 molecules respectively 
of FeTiOg. 

Before leaving the subject of oxides mention should be made of an investigation 
of a somewhat new phenomenon, the so-called magneto-electric effect. ‘There 
are, in fact, two distinct effects: one is the appearance of a magnetic moment 
which is proportional to an applied electric field, called for short the (ME), 
effect, and the other is the appearance of an electric moment which is proportional 
to an applied magnetic field, the (ME), effect. Both effects were anticipated 
by Landau and Lifshitz and Dzyaloshinskii predicted that they should be found 
in Cr.O3. The first experimental observation of the (ME), effect was made by 
Astrov. His measurements have now been repeated by Rado and Folen (U.S.A.) 
and have been extended to different temperature. These authors correlate the 
two magneto-electric susceptibilities with the parallel and perpendicular anti- 
ferromagnetic susceptibilities. In addition they report the first observations of 
the (ME), effect. 

The final papers in this session dealt with the magnetic properties of some 
sulphides and similar compounds. These exhibit a great diversity of behaviour 
partly because many can exist in non-stoichiometric form. ‘The most familiar 
example, and one of the most complex, is iron sulphide, commonly written FeS 
but actually existing in various crystal forms over a continuous range of 
compositions Fe,5 from «=0-915 to x=1-01 and showing a wide range of 
magnetic properties. ‘The near-stoichiometric compounds are antiferromagnetic 
(Néel temperature ~330°c) and have the NiAs structure. They develop a 
superlattice below 140°c which is apparently accompanied by a change in lattice 
dimensions and a sharp decrease in susceptibility. This decrease has been 
interpreted as resulting from a 90° rotation of the antiferromagnetic axis which 
is observed by neutron diffraction. Sparks, Mead and Komoto (U.S.A.) also 
observed the rotation by the same means but noticed that the temperature at 
which it takes place does not coincide with the ordering temperature. Further 
studies were carried out with samples for which «=1-000, 0-998 and 0-994, 
the accuracy of the iron to sulphur ratios being measured to about O05 oF. 
Magnetic, neutron and X-ray diffraction investigations show that the sequence 
of events on cooling (for the composition x=0-996 studied in detail) is: the 
substance becomes antiferromagnetic at the Néel temperature (~330°c) with 
the moments in ferromagnetic sheets normal to the c-axis with antiferromagnetic 
coupling between adjacent sheets. At about 185°c the moments gradually turn 
out of the basal planes—this is not accompanied by any significant change in 
susceptibility. At 142°c however, both lattice parameters exhibit abrupt changes 
resulting in a sudden decrease in cell volume, and this time large changes in 


The Kyoto Magnetism Conference 227, 


susceptibility are observed, which the authors suggest are due to a sharp increase 
of anisotropy or exchange energy brought about by the sudden change in lattice 
parameter at the appearance of the superlattice. Thus there is no relation between 
the ordering temperature and the magnetic rotation temperature. However 
their closeness and their strong dependence upon minute changes in composition 
serve to explain the discordant and often contradictory results obtained by 
previous sets of workers on this system. 

With compounds of this type it is impossible to divorce the nature of the 
chemical compound from its crystal structure. In many cases it is clear that a 
compound can possess a certain crystal structure only if vacancies in the lattice 
exist and these vacancies, like the component ions, must be considered to exist 
in an ordered arrangement, change in which may be expected to alter the properties 
of the compound. Some measurements by Yuzuri, Kang and Goto (Japan) 
amply support the view. They measured the saturation magnetization of the 
compound CrS,.,, after quenching from various temperatures up to 1000°c. 
Such a procedure may be expected to leave the vacancies in a state of progressively 
increasing disorder. Although the crystal structure remains unchanged the 
spontaneous magnetization decreases by a factor 10 after quenching from 
1000°c. 

Another compound system which exhibits interesting properties which can 
be altered by changing the composition is Mn,Sb. This is a normal ferrimagnetic 
compound. However, if small amounts of chromium are added a first-order 
transition from ferrimagnetic to antiferromagnetic ordering with increasing 
temperature occurs at a temperature dependent on the amount of chromium 
present. This work was reported by Jarrett, Cloud, Darnell, Bierstedt, Bither, 
Walter and Swoboda (U.S.A.), who show that the principal effect of the Cr is 
to introduce a slight contraction of the Mn,Sb lattice until a critical lattice 
parameter is reached at which the exchange interactions change sign. ‘The 
suggestion that compounds might exist in which such ferrimagnetic—antiferro- 
magnetic transitions could take place was first suggested by Kittel about one year 
before the Conference. In that short period these compounds have not only 
been discovered but appear to be remarkably well understood. 


5. ANISOTROPY AND RELAXATION 

The magnetic properties of single crystals of ferromagnetic metals and alloys 
are observed to depend on the direction within the crystal along which the magnetic 
field is applied. This effect is usually referred to, quite generally, as magnetic 
anisotropy. When a ferromagnetic substance is placed in a magnetic field its 
response is, in general, not instantaneous; instead the quantity which is being 
measured relaxes gradually towards a constant value, the rate of approach to 
equilibrium being determined by the nature of the relaxation process concerned. 


5.1. Metals 

In hexagonal cobalt the anisotropy energy may be expressed in the form 
E= K,sin?0+ Kysin‘ 6, 6 being the angle between the spontaneous magnetization 
and the hexagonal axis. Pauthenet, Barnier and Rimet (France) have measured 
the anisotropy constants K, and K, from 2 to 700°k. If K,>K,, @=0 or m/2 
depending on whether K, is positive or negative, giving either an axis or a plane 
of easy magnetization. However, if 0< —K,/2K,<1 the equilibrium angle in 
zero field is given by sin?@= — K,/2K, so that instead of an axis or a plane there 


228 E. W. Lee 


exists a cone of easy magnetization with semi-angle 9. This condition is realized 
in hexagonal cobalt between 518 and 598°x. 

Cubic cobalt, stable at high temperatures, transforms to the hexagonal form 
as it cools. The cubic form which is stable at room temperature only in the form 
of very fine particles can conveniently be obtained by precipitating the cobalt 
froma Cu-2% Coalloy. Rodbell (U.S.A.) measured the first anisotropy constant 
for cubic cobalt by measuring the nuclear resonance frequency as a function of 
orientation of a Cu-2%% Co crystal and also of a thin (cubic) cobalt crystal 
obtained by vacuum evaporation on to a MgO substrate. The temperature 
variation obeys the well-known tenth power law to surprisingly high temperatures. 

The ability of certain alloys to respond to the treatment known as magnetic 
annealing has been known for many years. However the observation early in 
1961 by ‘Takahashi and Kono that pure (hexagonal) cobalt exhibits magnetic 
annealing effects was somewhat unexpected. Some experiments by Graham 
(U.S.A.) show that the induced anisotropy is very large, about one-tenth of the 
single crystal value, and support the view that the phenomenon is closely connected 
with the cubic-hexagonal transformation that occurs about 700°K. It is suggested 
that the presence of a strong magnetic field at the transformation temperature leads 
to the formation of a crystallographic fibre axis which becomes frozen in as the 
sample cools. ‘The effect occurs because the magnetic anisotropy is large enough 
to be comparable with the small energy difference between the two phases. 

Slow relaxation processes are those involving atomic or ionic migration 
between adjacent lattice sites. ‘The nature of the migrant can often be identified 
by the temperature at which the relaxation is observed. Dietze and Balthesen 
(Germany) showed that a relaxation in silicon iron at 350°c is due to the diffusion 
of vacancies into the domain walls. The number of vacancies can be controlled 
by bombarding the sample with neutrons. 


5.2. Ferrites 

‘The magnetic anisotropy of ferrites is somewhat easier to understand than that 
of metals since ferrites are essentially ionic compounds and one may, to a first 
approximation, treat the ions in their different environments separately and 
independently. ‘lo counteract this somewhat, ferrites show considerable 
sensitivity in their electrical and magnetic properties to variations in composition, 
deviations from exact stoichiometry, and variability of the valence states of the 
magnetic 10ns. 

‘The most noticeable characteristic of the anisotropy of cubic ferrites is its 
sensitivity to small additions of cobalt. Bickford (U.S.A.) showed that the 
hexagonal ferrites exhibit similar behaviour showing an anisotropy which is 
approximately proportional to the concentration of cobalt added. Measure- 
ments reported by Smit, Lotgering and van Stapele (Netherlands) on the 
anisotropy of single crystals of 'Ti-substituted manganese ferrites of composition 
MnTi,,Feg_ O04 indicate that additions of Ti produce an effect similar to that 
of Co, namely that the anisotropy constants increase in proportion to the 
concentration of 'Ti present. ‘The interpretation, however, is somewhat different. 
Since titanium replaces the ferric ions in the compound the concentration of 
ferrous ions should be proportional to the total titanium content. It has been 
known for some time that the presence of a small quantity of ferrous ions in the 
technically important Mn—Zn ferrites increases their permeability at room 


The Kyoto Magnetism Conference 229 


temperature to around 5000. The authors suggest that the ferrous ions give 
rise to an anisotropy which is of such magnitude and sign as to annul that due 
to the remainder of the compound; the Mn—Zn ferrites are known to have very 
small magnetostriction and so a very high permeability results. 

Ferrites, particularly those containing cobalt, exhibit magnetic annealing 
effects. Studies of the kinetics of the annealing process by lida and Inoue (Japan) 
show that it is assisted by the presence of lattice vacancies. These workers 
stress the connection between the ability of a substance to be magnetically 
annealed due to the formation of an anisotropy distribution of atom pairs brought 
about by ionic migration at high temperatures in the presence of a magnetic field, 
and magnetic viscosity due to the same ionic migration in and out of domain 
walls. If the density of certain vacancies is reduced both effects are inhibited. 
An almost parallel investigation by Perthel (Germany) with cobalt-substituted 
nickel ferrite shows how difficult it is to draw general conclusions with these 
substances. He finds that neither the state of oxidation nor the presence of 
vacancies has any significant effect on the magnetic annealing process; it proceeds 
slower than in the Fe—Co ferrites and appears to depend in magnitude only on 
the cobalt ions themselves. 


6. NUCLEAR RESONANCE IN METALS 

Since the first observation of nuclear resonance in cobalt by Gossard and 
Portis in 1959 the technique has been applied to other materials. From the 
nuclear resonance frequency one may determine the field acting at the nucleus 
(in iron for example the field comes out to be 330,500 oersteds at 295°k) and 
from the width of the resonance line one may learn something about the processes 
of nuclear relaxation in ferromagnets as discussed by Portis (U.S.A.). It appears 
that in both metals and insulators the main relaxation occurs through the transfer 
of energy from the nuclear spins to the electron spins (i.e. spin waves are excited 
via the hyperfine interaction) and ultimately to the crystal lattice. Gossard, 
Jaccarino and Wernick (U.S.A.) measured the Knight shift (the difference between 
the nuclear resonance frequency for the free atom and that when it forms part 
of a metal) of the ?’Al nuclear resonance in the rare earth compounds RAI, 
where R=La, Ce,-—-—Lu. From the results it was deduced that the conduction 
electrons must be polarized (i.e. magnetized) by the intense local field of the 
rare earth ions. It turns out that the sign of the polarization is negative with 
respect to that of the ions, i.e. there exists a strong antiferromagnetic interaction. 
The magnetic properties of these compounds has been studied by Williams, 
Wernick, Nesbitt and Sherwood (U.S.A.). ‘The Curie temperatures of some of 
them are surprisingly high; those of SmAl, and NdAl, are 122°K and 68°K 
respectively. Moreover they are uniformly higher than those of the isomorphous 
compounds Os,R, Ir,R and Ru,R and it is suggested that this is due to differences 
in the nature of the polarization of the conduction electrons in the different 
systems. These workers have also studied a continuous system of solid solutions 
which occurs in the Gd,,Prq_,)Al, system which are ferrimagnetic. ‘The magnetic 
moments of Gd and Pr, although parallel within themselves nevertheless are 
always antiparallel to each other. 

Although the number of nuclei with which resonance may be observed is not 
high, the potentialities of this technique are well illustrated by two papers dealing 
with alloys. ‘T'sujimura, Hihara and Kushida (Japan) observed the nuclear 


230 E. W. Lee 


resonance of ®Cu in iron and cobalt. The results suggest that the field at a Cu 
nucleus is simply due to the 4s electrons polarized by the spins of the 3d electrons 
of the solvent atoms. Under high resolution the nuclear resonance 1s observed 
to show a fine structure. In alloys this may be attributed to individual nuclear 
resonance from atoms having different environments. ‘Thus La Force, Ravitz 
and Day (U.S.A.) observed several resonance lines in Fe—Co alloys. Relative 
intensity measurements led them to account for the resonance as follows : 

(1) at 278-8 Mc/s due to the *°Co in an atom surrounded by eight nearest 

neighbour iron atoms, 

(2) at 282-3 Mc/s due to *°Co in a nearest neighbour environment of seven 

iron atoms and one cobalt atom, 

(3) at 285-5 Mc/s from *°Co surrounded by six Fe and two Co atoms, 

(4) at 288-6 Mc/s (5 Fe and 3 Co). 

It may be noticed that the nearest neighbour interaction is, to a good 
approximation, simply additive. That is, if any nearest neighbour iron atom 
is replaced by cobalt then the resonance frequency of the central °°Co nucleus 
is shifted to a higher frequency by about 3-4 Mc/s. The authors conclude that 
these results favour a model of ferromagnetism in which the unpaired d electrons 
are well localized. 


7. MOssBAUER EFFECT 

The Mossbauer effect, the resonant absorption of nuclear y-rays, may be used 
to study the hyperfine structure of *’Fe in magnetic materials. Experiments 
are usually performed by observing the absorption by stable *’Fe nuclei of the 
14-4 kev y-ray from a source containing radioactive *’Fe. On account of the 
large magnetic field at the nucleus the absorption line shows marked Zeeman 
splittings and from a study of the Zeeman spectrum the magnetic field at the 
nucleus can be determined. To this extent the measurements yield information 
complementary to that given by nuclear resonance studies but the Méssbauer 
effect can also be used to obtain information concerning the electric-field-gradient 
tensor at the nucleus. ‘This is beyond the scope of the nuclear resonance 
investigations because *’Fe does not possess a nuclear quadrupole moment. 
Bauminger, Cohen, Marinov and Ofer (Israel) found that the effective magnetic 
fields at the °’Fe nuclei in yttrium iron garnet at room temperature are 3:90 x 105 
oersted and 4-85 x 10° oersted for tetrahedral and octahedral sites respectively. 
At liquid air temperature these fields increase to 4:6 x 10° oersted and 5-4 x 105 
oersted respectively. The fields in dysprosium iron garnet were investigated 
using a source containing !°[b, the absorber containing 1!Dy. At 85°K the 
effective field is 3-5 x 10° oersted and falls by a factor of 4:6 at 300°xk. This is 
just the factor by which the dysprosium sub-lattice magnetization changes 
between the two temperatures and clearly indicates the source of the field. 

An extremely interesting application of the Méssbauer effect was reported 
by both the Israel group and by Ono, Ishikawa and Ito (Japan). It concerns 
magnetite. ‘This is a ferrimagnetic spinel with two types of cation sites: A-sites 
with tetrahedral oxygen coordination containing Fe®+ ions and octahedral B-sites 
containing Fe?* and Fe**, i.e. it is an inverse spinel. However it exhibits a very 
unusual transition at 120°K which Verwey suggested many years ago was an 
electronic order—disorder transition such that above 120°K there is a fast 
electron exchange between Fe?+ and Fe?+ in the B-sites whereas below 120°x 


The Kyoto Magnetism Conference 231 


the two types of ions are separately ordered. This hypothesis, although generally 
accepted has, so far, defied direct proof. Both sets of workers agree that below 
120°K the field at the Fe?+ ion on a B-site ion magnetite is about 5-1 x 10° oersted, 
the same as it is in nickel ferrite and y-ferric oxide. Moreover as the temperature 
is raised through the transition temperature the complexity of the Mossbauer 
spectrum (which is very considerable) decreases. This evidence of an effective 
equalizing of the moments on the B-sites suggests that Verwey’s hypothesis is 
essentially confirmed. But it is difficult to identify all the Méssbauer lines 
unambiguously and further experiments are in progress. 


8. MAGNETISM AT LOW TEMPERATURES 

Two whole sessions were devoted to magnetic phenomena specifically 
associated with temperatures in the liquid helium range. 

Recent Russian work on piezomagnetism was described in some detail by 
Borovik-Romanov, Aleksanjan and Rudashevskii. This is the magnetic analogue 
of the piezo-electric effect and is the appearance of a magnetic moment when a 
uniaxial stress is applied to a crystal in the absence of an external field. The 
effect was investigated theoretically by Dzyaloshinskii who showed that certain 
antiferromagnetic crystals having a tetragonal or rhombohedral lattice should 
exhibit it. It was first observed experimentally in CoF,. New results were 
presented for FeCO, which shows a piezomagnetic effect at 20°k when the 
specimen is cooled under stress; without stress the effect is much smaller and 
this is attributed to the presence of antiferromagnetic domains. The second 
part of their paper described an investigation of the piezomagnetic effect in CoF, 
using ultrasonic waves as a means of stressing the crystal. The temperature 
dependence of the effect follows closely the temperature dependence of the 
sub-lattice magnetization. 

If ZnO, is added to the antiferromagnetic fluorides MnF, and CoF, the 
zinc ions replace the transition metal ions and effectively dilute them. Since 
antiferromagnetism, like ferromagnetism, is a cooperative effect requiring the 
active participation of several nearest neighbours one would expect to observe 
an abrupt disappearance of antiferromagnetism when a certain critical dilution 
is reached. This was confirmed by some measurements reported by Baker, 
Lourens and Stevenson (G.B.). 

An ideal paramagnetic substance is one that obeys Curie’s law. However 
at sufficiently low temperatures all paramagnetic materials show departures 
from ideal behaviour because of interactions between the magnetic ions. 
Normally the dominant effect is exchange interaction but in salts containing 
rare earth ions in which the exchange interaction is weak there are grounds for 
believing that the dominant interaction is purely magnetic in origin and is 
predominantly dipolar in kind. Extensive studies on such salts were reported 
by the Oxford group. Cooke, Edmonds, Finn and Wolf (G.B.) studied the 
rare-earth ethyl sulphates paying special attention to dysprosium ethyl sulphate. 
The peculiar simplifying feature of this salt is that the dysprosium ion is so 
situated that the crystal field splits the free-ion ground state in such a way that 
there is a very large g-value (10-8) parallel to the symmetry axis of the crystal 
and a very small, almost negligible, one perpendicular to it. Fig. 3 shows the 
observed susceptibility as a function of temperature. he solid curves represent 
theoretical calculations for purely magnetic interactions of an increasing degree 


2o2, E. W. Lee 


of refinement. Similar agreement between the calculated and observed entropy 
curves confirms the view that magnetic interactions are dominant. Below 0-13 “K 
a transition to ferromagnetism occurs, the salt showing domain and hysteresis 
effects as well as pronounced relaxation effects which make the susceptibility 


+ (%)" 


Fig. 3. Susceptibility of dysprosium ethyl sulphate as a function of temperature shown by 
plotting 1/T* against 1/T (in °k~!). The circles represent experimental points. 
The lines labelled 1-4 represent theoretical curves of increasing mathematical 
refinement : (1) Molecular field; (2) Van Vleck expansion ; (3) Ising model with 
nearest-neighbour interactions ; (4) Ising model with nearest-neighbour-and-next- 
nearest-neighbour interactions. (Reproduced by kind permission of Dr. W. P. Wolf.) 


field- and frequency-dependent. Analogous investigations using GdCl, were 
described by Wolf, Leask, Mangum and Wyatt (G.B.). This substance has 
a simple hexagonal structure in which the Gd** is magnetically isotropic. It 
shows a transition to ferromagnetism at 2-20°k, a temperature too high for the 
transition to be due solely to dipolar interactions and it is concluded that exchange 
interactions must be significant. However the dipolar interactions can be 
calculated and their contribution subtracted from those due to exchange. In 
particular, the observed anisotropic effects must be solely due to dipolar forces 
since the exchange forces are isotropic; the anisotropy energies calculated on 
this basis are in good agreement with those observed by experiment. The 
magnetic ions are considerably more densely packed in this compound than in 
the ethyl sulphate previously discussed and this, together with the large moment 
of the magnetic ion, gives rise to a saturation magnetization at 0°k of about 
660 e.m.u./cc. This is more than one-third that of iron. 


The Kyoto Magnetism Conference 233 


The magnetism of the compound cobaltous fluosilicate, Co(H,O),SiF,, is 
due to the Co?+ ion. However in this case exchange interactions are predominant. 
According to Ohtsubo and Kanda (Japan) the salt is antiferromagnetic at 1°K 
with a weak ferromagnetism superimposed. Similar behaviour was observed 
in the corresponding manganous salt. In both cases the weak ferromagnetism 
disappears at the Néel temperature andis probably of the same origin as that 
which occurs in y—Fe,O3. 

Several papers dealt with the appearance of antiferromagnetism in salts at 
low temperatures. Vivianite, Fe,(PO,),.8H,O, studied by Poulis and van der 
Lugt (Netherlands) using the nuclear resonance of the phosphorus ion, appears 
to be paramagnetic at liquid hydrogen temperature but becomes antiferromagnetic 
at 4°K. Sawatzky and Bloom (Canada) used a similar technique to investigate 
CoCl,.6H,O which has a paramagnetic—antiferromagnetic transition at 2:28 °K. 

Specific heat measurements of transition-metal chlorides have yielded 
valuable information concerning magnetic transitions. FeCl, .4H,O, investigated 
by Friedberg, Cohen and Schelleng (U.S.A.) exhibits a peak in the specific heat 
at 1-6°K suggesting a paramagnetic—antiferromagnetic transition, and also a 
Schottky anomaly at about 3°K due to the splitting of the ground state of the 
Fe?+ ion by the crystal field. The specific heats of anhydrous CuCl, and CrCl, 
have been determined by Stout and Chisholm (U.S.A.) over the temperature 
range 11°K to 300°x. ‘The susceptibilities of these compounds have been 
measured previously and show maxima at 70°K and approximately 40°K 
respectively, suggesting the appearance of antiferromagnetism at these tempera- 
tures. However the specific heats show peaks at 23-91°K and 16-06°x respectively. 
This apparent discrepancy may be related to the crystal structures; both 
compounds have long chains of the form 


ClY 7 SIN Vi 
M Viesees 
ONES ENG GHA Ne 


in which the metal ions are linked by the closer chlorine ions. 

The authors show that the main interaction between M ions is a superchange 
via the chlorine ions, with a weaker interaction between metal ions belonging to 
neighbouring chains. It is suggested that the peaks in the specific heat mark 
the beginning of long-range antiferromagnetic ordering between M ions in 
adjacent chains whereas the maximum in the susceptibility arises from the 
interactions within the chains. 


9. FERROMAGNETIC RESONANCE 

The first of two sessions devoted to ferromagnetic resonance was preceded 
by a group of papers dealing with rare earth iron garnets, chiefly yttrium, 
The importance of these compounds was reviewed by Van Vleck (U.S.A.). 
In these materials the interaction between rare earth ions, and the reaction of 
the rare earth ions on the iron sub-lattice may both be disregarded. The coupling 
between the iron sub-lattice and the rare earth ions may be treated in terms of a 
molecular field acting on the rare earth ions alone, which is taken to be proportional 
to the magnetization of the iron. This can be determined by measurements on 
yttrium iron garnet (YIG) in which the rare earth ions are replaced by non- 
magnetic yttrium. One can thus study the magnetization of rare earth ions in 
high fields, without the necessity of constructing large magnets. Frequently one 


234 FE. W: Lee 


starts with pure YIG and then adds a small percentage of rare earthions. Pearson 
and Cooper (G.B.) studied the magnetic anisotropy associated with the rare earth 
ions inthis way. The anisotropy may be calculated from a knowledge of the energy 
level splitting in the crystal field but the values obtained in this way are not in 
very good quantitative agreement with the measured values. 

One of the most important items of information yielded by ferromagnetic 
resonance is the spectroscopic splitting factor g, defined by the relation 
w=g(e/2me)H, where w is the resonance frequency in a field H, e and m are 
the charge and mass of the electron and ¢ is the velocity of light. At one time 
it was thought that this quantity should be the same as the quantity g’ equal to 
ratio of the magnetic to mechanical moment and which may be determined from 
the angular momentum which a body acquires when it is magnetized (Einstein—de 
Haas effect). However Kittel and Van Vleck showed that these quantities do, 
in fact, represent different things and, instead of being equal, obey the relation 
1/g+1/g’=1. The numerous determinations of g from ferromagnetic resonance 
have led to an increased interest in precise values of g’ and Scott (U.S.A.) 
described an extensive investigation aimed at repeating and extending the older 
direct measurements of the magnetomechanical ratio using the Einstein—de Haas 
effect. It appears that the Kittel-Van Vleck relation has been verified, at least 
within the (quite small) limits of experimental error. At the Grenoble conference 
three years previously attention was drawn to the anomalously low value of g’ 
obtained in 1935 by Coeterier for pyrrhotite, namely g’ =0-63 and it was suggested 
that a reinvestigation should prove rewarding as this value does not fit in with 
any of the current theories of this substance. Scott, who now finds g’=1-9, 
suggests that a possible reason for the low value obtained by Coeterier lies in 
the highly anisotropic nature of the material which makes it difficult to avoid 
spurious torques. 

Considerable attention is being paid to the problem of the width of the 
ferromagnetic resonance line. Lin and Neaves (U.S.A.) studied the resonance 
of fine nickel powders with a view to studying the local field acting on each 
particle. ‘They conclude that a major cause of line broadening is thermal fluc- 
tuation in the direction of the magnetization in the finer particles. Morrish and 
Valstyn (U.S.A.) attempted an even more complex problem, that of determining 
the distribution of the shapes of fine particles of y—Fe,Os. 

One major problem in ferromagnetic resonance has been the width of the 
resonance line in pure materials. Experiments habitually yielded values of 
some hundreds of oersteds whereas intrinsic relaxation processes considered 
theoretically obstinately refused to broaden the line by more than about one 
oersted. With the discovery of yttrium iron garnet for which the line width, 
initially observed to be about 10 oersted, was later reduced to less than 1 oersted, 
it became clear that a material was available for the first time in which intrinsic 
relaxation processes could be studied. Theoretical progress on this topic was 
reviewed by Kittel (U.S.A.), and the present experimental situation was summed 
up by Lecraw and Spencer (U.S.A.). It appears that the major contributions 
to the line breadth in YIG come from surface roughness of the sample and the 
presence of small quantities of rare earth and other impurities. When these are 
eliminated line widths as small as 0-15 oersted can be obtained. 

Before leaving the subject of ferromagnetic resonance proper mention should 
be made of an interesting new technique proposed and described by Schlomann 


The Kyoto Magnetism Conference 235 


(U.S.A.) and termed the parallel pumping technique. In a normal resonance 
experiment the d.c. and microwave fields are perpendicular. However it has 
been shown that even when these fields are parallel a ferromagnetic material 
is able to absorb power from the microwave field. The absorption is very small 
at arbitrarily low power levels and has not been detected under these conditions. 
However, as the amplitude of the microwave field is increased the absorption 
becomes non-linear and instabilities are observed when the microwave field 
exceeds a well-defined threshold corresponding to the generation of pairs of 
spin waves of equal and opposite wave number and frequency half that of the 
microwave field. The threshold field for instability can be varied by altering the 
d.c. field. This technique promises to be a very powerful method and like all 
spin wave resonance methods may be used to obtain values of the exchange 
constant. Values determined in this way are in good agreement with those 
determined from low temperature specific heat data. 


10. ‘THIN FILMS 

The behaviour of thin magnetic films is of great interest; their bi-stable 
properties (the two stable states being positive and negative remanence) permit 
them to be used as the elements in memory units of automatic computers. They 
are usually vacuum-deposited and are commonly about 1000 & thick. Deposition 
in a magnetic field makes them uniaxially anisotropic with the easy direction 
parallel to the field direction; the perpendicular direction in the plane of the 
film. is a hard direction. Thus formed, the B—H loop in the easy direction is 
almost perfectly rectangular, a desirable feature for a memory: unit; the B-H 
loop in the hard direction shows hardly any hysteresis and is linear almost up to 
saturation which 1s achieved in a field H,, known as the anisotropy field.* The 
precise origin of this uniaxial anisotropy is still a major problem. From a study 
of nickel-iron films in the range 50-100% Ni it has been established that most 
of the anisotropy must be due to the preferential ordering of pairs of Ni and Fe 
atoms parallel to the field applied during deposition, with an additional (usually 
smaller) contribution arising from a magneto-elastic energy due to the presence 
of an anisotropic stress coupled with the magnetostriction (which depends on 
the direction of I, and therefore gives rise to an energy which depends on the 
direction of I,). Strong circumstantial evidence for the latter mechanism arises 
from the fact that the uniaxial anisotropy goes through a minimum at the 
composition at which the magnetostriction is observed to be zero. However 
although anisotropic stresses in evaporated films have been observed their origin 
remains obscure. 

Recent discoveries which indicate the inadequacy of a simple explanation such 
as that outlined above were listed by Smith (U.S.A.). Among these are: 

(1) The direction of the easy axis in thin films varies by a few degrees from 
one part of the film to another. 

(2) Small regions of a film frequently exhibit small but well-defined regions 
in which the easy axis is at right angles to that in the bulk of the film. 

The origin of such regions is unknown and although a definite correlation 
with the magnitude of the magnetostriction has been found, they cannot be 


* The uniaxial anisotropy energy may be written Ex=K sin? @ where @ is the angle 
between the magnetization I, and the easy axis, and K is the anisotropy constant. It may be 
shown that H,=2K/I,. 


236 E. W2Lee 


simply due to regions of strong local stresses since other regions would then be 
expected to exist in which the local easy direction would be at an immediate 
angle, say 45° to that of the bulk of the film; such regions are not observed. 

In addition to the difficulties mentioned above it is found that the magnitude 
of the anisotropy field is dependent upon (1) alloy composition, (2) substrate 
material, (3) substrate temperature, (4) evaporation rate, (5) crucible material. 
An ingenious theory of the mechanism of formation of the uniaxial anisotropy 
was put forward by Prosen, Holmen, Gran and Cebulla (U.S.A.). The central 
feature is the formation of NiO at grain boundaries due to the presence of slight 
traces of oxygen in the evaporation chamber. Theories of this kind received a 
blow (and many experimenters without access to such advanced equipment 
breathed a sigh of relief) from a basic contribution by Graham and Lommel 
(U.S.A.) who deposited nickel films in very high vacua. They find that the 
pressure during the evaporation of nickel films has no effect on the magnetic 
properties, at least over the range of pressure from 10-° mm Hg (this is the 
pressure at which films are commonly deposited and at which the oxygen 
concentration might be appreciable) to 10-°mm Hg. Such films exhibit 
magnetic annealing whereas pure nickel foils of comparable thickness prepared 
by cold-rolling do not. It is concluded that the magnetic annealing of vapour 
deposited films must in some way be connected with their internal structure. 

The suggestion made by Kittel at the previous conference at Grenoble in 
1958, that standing spin waves may be directly excited in thin magnetic films by 
microwave resonance has been confirmed by several experimenters. Physically, 
one observes a number of microwave power absorption peaks caused by excitation 
of the magnetization in normal modes of the form sinn7Z/L where n is an odd 
integer and L is the film thickness, analogous to the normal modes of vibration 
of a stretched string fixed at both ends. The importance of such measurements 
lies in the fact that they yield direct values of the exchange coupling constant. 
This information is not otherwise readily obtainable except from magnetization 
measurements close to absolute zero. 

A completely new thin-film method of measuring the exchange constant was 
described by Methfessel and Thomas (Switzerland). It is based on the response 
of the magnetization to an inhomogeneous magnetic field, conveniently provided 
by passing a current through a thin film or along a thin cylindrical wire. Results 
obtained with films of composition 80-20% Ni-Fe are in good agreement with 
those obtained by other methods. 


11. "TECHNICAL MAGNETIZATION 

The enormous progress in technical magnetization that was made just after 
the war has now slowed down considerably and a period of consolidation and 
steady improvement in the properties of technical materials is with us. 

A paper by Paulus and Guillaud (France) provides a good illustration of this. 
These authors have carefully studied the way in which the magnetic properties 
of a polycrystalline ferrite depends upon the size of the grains. When the grain 
size is very small the initial magnetization arises by coherent rotation of the 
magnetization within single domain crystals. When the grain size is larger and 
the polycrystalline aggregate is dense magnetization occurs by the normal process 
of displacement of domain boundaries. If the density is low, however, the voids 
which are present provide locking points for the domain boundaries and 


The Kyoto Magnetism Conference 237 


magnetization is then due to a process involving bulging of the domain walls. 
These observations enable a qualitative interpretation to be made of the effects 
of firing temperatures on the low field properties of ferrites. Manganese zinc 
ferrites have considerable technical importance because of their magnetic softness. 
However, the best magnetic properties are obtained when the material 
is very pure and then it is found that the resistivity is low and consequently 
eddy current losses are very high. Since these ferrites are usually operated in 
alternating fields this is a serious limitation. It can, however, be overcome by 
adding small quantities of CaO; autoradiograph studies and measurements of 
the frequency dependence of the resistivity show that the calcium combines in 
an unknown way with the ferrite and segregates out in the intergrain boundaries 
as a high resistivity material. 

The most direct methods of studying magnetization processes are those which 
permit the domains themselves to be observed. Szczeniowski and Wrzeciono 
(Poland) used the Bitter technique to study domain wall movement in the 
compound Mn,;Ge;. Despite its large uniaxial anisotropy, comparable with that 
of cobalt, the coercive force of single crystals is less than 1 oersted. This 
technique is not readily adaptable for temperatures other than room temperature 
and Kojima and Goto (Japan) employed the Faraday effect to study the 
temperature variation of the domain width in thin layers of the hexagonal 
compound BaFe,,0,, up to 250°c. A less direct method, the analysis of 
magnetothermal measurements was used by Bates and Pacey (G.B.) who 
obtained for the first time quantitative evidence for domain nucleation processes 
at fields near the coercive force. 

In the second half of the session devoted to high coercivity materials Dietze 
(Germany) outlined a new theory of the coercive field. Good agreement with the 
observed values for the coercive force of iron containing precipitated copper is 
claimed. Gaunt and Silcox (G.B.) reported electron microscope measurements 
of the precipitated particles when a 1:5% Co in Au alloy is heat-treated after 
being quenched from 1000°c. It is suggested that the very high coercivity 
( ~ 2000 oersted) can be explained only by assuming that the precipitated particles, 
which are in the form of discs, consist of the highly anisotropic hexagonal (as 
opposed to the more usual f.c.c.) cobalt. 


12. PHYSICAL PROBLEMS OF ROCK MAGNETISM 

It is now generally accepted that measurements of the remanent magnetism 
of rocks can be used to study events in the geological past. Clear indications of 
polar wandering were provided by the older data; more recently evidence has 
been accumulated which strongly supports the theory of continental drift. 
It is, of course, essential to understand fully the behaviour of the magnetic 
constituents of the rocks most frequently studied. Very often these constituents 
(basically FeO, Fe,O,; and TiO.) are antiferromagnetic and are present in the 
form of very fine particles. Their behaviour has been studied theoretically by 
Néel (France). Since the spins in many antiferromagnetic substances are 
ordered ferromagnetically in parallel sheets with antiparallel ordering between 
adjacent sheets it is clear that a sample of such a substance will have zero magnetic 
moment only if it possesses an even number of sheets. In general this will not 
be the case, and there is a real likelihood that a very small particle will possess a 
permanent magnetic moment. An assembly of such particles will exhibit 


238 KE. W.siee 


superparamagnetism and the average magnetization, defined as magnetic moment 
per gram or per cc will increase as the particle size decreases. These predictions, 
first observed by Creer, have been confirmed by Cohen, Creer, Pauthenet and 
Srivastava (France) using NiO powders of 204 and 400A diameter and similar 
powders of Cr,O; and «—Fe,O3. 

When a specimen is cooled through its Curie temperature in a magnetic field 
the direction of the remanent magnetization at room temperature is sometimes 
found to be opposite to that of the applied field; this is then termed the reverse 
thermo-remanent magnetization. Since this is the way in which most igneous 
rocks will have acquired their magnetization it is vital to know which rocks show 
the reverse T.R.M. and which rocks do not in order to settle the question of a 
reversal of the earth’s field during geological time. Ishikawa and Syono (Japan) 
have studied this phenomenon in the FeTiO; system. If this is denoted by 
«FeTiO,;-(1—x«)Fe,O, then only those compositions for which 0-7>x>0-4 
show the effect. This is in fact the range of composition over which order—disorder 
transitions occur. The results show that the appearance of the reverse T.R.M. 
is associated with the presence of a metastable phase in which the direction of 
magnetization does not reverse even in fields as strong as 30000 oersted. This 
points to some form of exchange anisotropy but the precise mechanism is not 
yet known. At present it appears that the phenomenon of reverse T.R.M. is 
fairly rare; its existence has naturally made interpretation of field data a little 
less certain but upholders of the belief that the earth’s field has reversed several 
times in the geological past have, so far, successfully withstood all assailants. 


13. CONCLUSIONS 

The immediate impression gained from the Kyoto Conference is of the 
numerous and very significant advances which have taken place since the previous 
conference at Grenoble in 1958. New techniques have been applied and new 
results obtained. ‘The range of new materials is very much wider than before 
and some of these exhibit effects whose existence was not suspected previously. 

On looking through the list of papers presented one is struck by the multiple 
authorship of almost all of them. ‘This ever-growing trend reflects the increasing 
complexity of the subject and the diversity of experimental and theoretical 
techniques required to solve its problems. Although there will always be a place 
for the individualist a greater and greater fraction of investigational work in 
magnetism is being carried out by research teams and the opportunity for 
scientists to interchange ideas is becoming more vital then ever before. All 
those who attended the conference will agree that it amply provided such an 
opportunity. Our warmest thanks are extended to all who made the conference 
possible, in particular to the Organizing Secretary, Professor T. Nagamiya, 
but above all to our Japanese hosts who, with their almost overwhelming 


hospitality transformed the visit of every foreign delegate into an unforgettable 
experience. 


The Author : Dr. E. W. Lee is Senior Lecturer in Physics in the University of Sheffield. 


The 46th Annual Exhibition of The Institute of 
Physics and The Physical Society - 15—19 January, 1962 


Universities, colleges, and teaching hospitals have this year made a significant 
contribution; some with single exhibits of apparatus, such as the vast assembly 
for recording the multiple scattering of particle tracks in nuclear emulsions 
(University College, London) and the ultra-sound image camera which displays 
a picture obtained in 4 mc/s sound waves on a television tube (Northampton 
College of Advanced Technology and University College Hospital); and some 
with larger displays, such as the Cavendish Laboratory, Cambridge, which 
included some apparatus for lecture demonstration and class use. The Institute 
and Society itself has branched out too; and on a second stand the Education 
Committee had a display of apparatus, some from the United States and some 
developed in this country, for teaching modern physics, and also elementary 
physics on modern lines. 

Next, the DSIR Research Associations. The use of microwaves for measur- 
ing the moisture content of materials (Building Research Station), and of 
B-particles for examining tree sections (Forest Products Research Laboratory) 
show the rapid progress of modern techniques into this field. The National 
Physical Laboratory had a demonstration on the generation of coherent light in 
an optical maser, and equipment for the measurement of spin-spin and spin- 
lattice relaxation times. The British Iron and Steel Research Association had 
a curve-scanner, which (by departing from simple x—y scanning) may be used 
for the recognition of printed characters. The aim is to develop a device that 
can ‘read’ normal handwriting, and translate this into computer language; 
so people may soon have to discipline themselves to write it. 

The Royal Radar Establishment demonstrated the development of very 
pure indium antimonide as an infra-red detector, and its use in the millimetre 
range (at 1-5°K), and gave examples of the results obtained from various plasma 
sources including ZETA. It also showed an example of the new ‘ fibre optics ’ 
technique, in which end-on stacks of glass fibres act as light guides between 
the phosphor of a cathode-ray tube and a photographic plate, enabling direct 
contact exposure to be used. The Meteorological Office showed a number of 
radio-sondes, and also the ozone equipment which will be included in the load 
of the next British satellite. 

The Safety in Mines Research Establishment had a rescue breathing 
appliance using liquid oxygen, which keeps the wearer at a reasonable 
temperature. 

Among the exhibits from firms, it is often difficult for the ordinary physicist 
to spot what is new in principle, as distinct from technical advances. In the 
former class came the emission of ‘hot’ electrons emitted from cold semi- 
conductors in a high electric field (Hirst Research Centre, the GEC); the 
measurement of angular velocity by means of Coriolis forces acting on a vibrating 
system (S. Smith & Sons, England, Ltd.); and an interference microscope 
objective (W. Watson & Sons). The latter class included fission chambers 
for the measurement of neutron flux at high temperatures within a reactor 
(20th Century Electronics, Ltd.), a multilayer image intensifier (English Electric 
Valve Co. Ltd.), and an ion rocket motor (Elliott Brothers, Ltd.). 


Harwell Reactor School: Science Masters Course No. 2 


A course will be held for Science Masters from 9th to 13th April, 1962, comprising 
lectures, practical work, demonstrations and visits. The course will cover some recent 
applications of nuclear physics, particularly those at Harwell. It is not intended that the 
lectures and experiments should necessarily provide material directly applicable to a school 
science syllabus. Lectures will be given by members of the Atomic Energy Research 
Establishment, Harwell. The visits to installations at Harwell are designed to illustrate 
some of the material in the lectures. 


Lectures include: Experimental Nuclear Physics; Physics and Engineering of Nuclear 
Reactors; Types and Uses of Nuclear Reactors ; Health Physics; Instrumentation; Isotopes; 
and Thermonuclear Reactions. 


There will be short visits to: The National Institute for Research in Nuclear Science; 
Particle accelerators at A.E.R.E.; Chemistry and Remote Handling buildings; ZETA (Zero 
Energy Thermonuclear Assembly); BEPO, LIDO and DIDO reactors and irradiation 
ponds. 


The experiments will be on two evenings, including a selection from the following: 
Fundamentals of Geiger counting; determination of half life; absorption of beta and 
gamma radiation; ion chamber characteristics; neutron detectors; neutron and gamma 
radiation shielding; demonstration of reactor behaviour by means of reactor simulator. 


Science masters who do not live locally will be accommodated at Lincoln College, 
Oxford. 


Application forms and further details are available from: The Manager, Reactor School, 
A.E.R.E. Harwell, Didcot, Berks. 


BOOK REVIEW 


Techniques of High Energy Physics. Edited By Davip M. Ritson. (Interscience Publishers, 
1961.) [Pp. xii+540.] 126s. 

This is Volume 5 of an Interscience series of which R. E. Marshak is the Editor, assisted 
by a distinguished and expert panel of advisers. 

There are eleven chapters written by authorities, on general properties of particles and 
radiation by D. M. Ritson (the editor of the volume); diffusion cloud chambers by R. 
Schluter; bubble chambers by I. A. Pless; nuclear emulsions by Mildred Widgoff; digital 
computers by D. O. Caldwell and J. E. Flanagan; ionisation counters by R. Wilson; scintilla- 
tion and Cerenkov counters by D. M. Ritson and R. Weinstein; transistors by V. L. Fitch; 
beam optics by D. Luckey; target preparation by G. S. Janes and beam monitoring iy 
D. O. Caldwell and G. S. Janes. Six appendices give collections of relevant useful data 
and some newer information (e.g. on spark chambers). ‘There are excellent lists of refer- 
ences, although the dates seem to show that some articles were waiting rather a long time 
for publication. 

The field has undergone a revolution in the last decade, and some traditional techniques 
have quite properly been omitted. ‘The rapid development has inevitably caused some 
deficiencies, notably the problems of track following engines and other automatic or semi- 
automatic devices for data analysis which are mentioned but not adequately discussed 
The range of topics has been well chosen, and it is particularly pleasing to see a chapter 
on digital computers (which is, however, given a wholly American slant) since these 
machines are becoming an essential part of high energy experiments. 

The book succeeds in its editor’s expressed aims of putting the various experimental 
techniques into perspective for professionals and helping beginning physicists, and it can 
be recommended to members of both groups. Ahaxer Pieler 


Electrons in Metals: A Short Guide to the Fermi Surface 


by J. M. ZIMAN 
University Lecturer in Physics, and Fellow of King’s College, Cambridge 


Part I. THE ELECTRON GAS 


‘Either be wholly slaves or wholly free.’—Dryden 


1. INTRODUCTION 


Browsing in books, review articles and conference reports, one sometimes 
nowadays comes across pictures like fig. 1. This peculiar object, which might be 
mistaken for a piece of modern sculpture, is labelled ‘The Fermi surface of 
magnesium’. What does it mean, to say that magnesium has a Fermi surface, 


Fig. 1. The Fermi surface of magnesium, or Leo Falicov’s Monster. 


and how is it determined? This is the first of a series of articles in which an 
attempt is made to answer these questions. 

The Fermi surface is a mathematical construction related to the dynamical 
properties of the conduction electrons in a metal. To understand its significance 
we must first accept the idea that these electrons form a gas obeying Fermi- 
Dirac statistics. This is dealt with in the present article. In the second article 
we consider the effect on this gas of the regular crystalline lattice of the ions of 
the metal amongst which the electrons move. We show that this creates gaps 

Gre: Q 


242 J. M. Ziman 


in the range of energies allowed to the electrons, and we learn how to construct 
the Brillouin zone—a polyhedron in momentum space whose boundaries define 
the positions of the energy gaps. a—s 

The third article is devoted to a study of the dynamical properties of the 
electrons in the allowed energy bands, under the influence of electric and 
magnetic fields. We learn here how these properties can be derived from the 
shape of the Fermi surface, and we study the problem of calculating the shape 
of the Fermi surface from first principles. We shall find that the idea that 
the electrons are nearly free can be justified mathematically, in spite of the very 
strong interactions between electrons and ions, and between the conduction 
electrons themselves. In the fourth article we look at real metals, especially the 
alkali metals, and show how such properties as electrical conductivity and 
thermoelectric power are determined by the shape of the Fermi surface; this 
article also gives an account of the anomalous skin effect, which provided the 
first detailed picture of the complicated shape of the Fermi surface in copper. 

In the final article, we look at other methods for the experimental study lof 
the Fermi surface. These methods all depend on the use of strong magnetic 
field which cause the electrons to move in helical paths in the metal. These paths 
have their representation in momentum space as orbits on the Fermi surface, 
and give rise to characteristic phenomena such as cyclotron resonance, magneto- 
resistance, the de Haas—van Alphen effect, the magneto-acoustic effect, etc. 
By studying the variation of these effects as one varies the direction of the 
magnetic field, one can map out the Fermi surface, and make a model of it for 
each metal. 

These articles are only travellers’ tales, not a Baedeker. I have only tried to 
make the general theory plausible; there are many admirable books which deal 
with the subject in all degrees of rigour and which are listed at the end of this 
chapter. For this reason, no specific references to the original literature will 
be given in the main text; it may be assumed that these can all be discovered 
in the books which I have listed. 


2. MOBILE ELECTRONS 


Consider sodium. We all know that it is a monovalent element because it 
has just one electron outside a closed shell. This electron is very loosely bound, 
and may easily be lost, leaving a sodium ion. Almost all the ordinary chemistry 
of sodium refers to this ion, in water, in crystals such as NaCl, etc. 

Now put two neutral sodium atoms together. They will begin to interact 
with one another. Atom A may attract the valence electron from B and ionize it. 
Or, both valence electrons may prefer to go to B, leaving A temporarily bare. 
If the two nuclei (bearing their closed shells) come near enough together, it 
becomes possible for both the electrons to circle about both ions together, just 
as if we had a doubly-charged nucleus (e.g. He) with two electrons in orbit 
around it. In fact this is a rather favourable arrangement, because the two 
electrons form almost a closed atomic sub-shell; the Na, molecule is normal in 
the vapour. 

Now let us pack together more and more sodium atoms, as in a solid or a 
liquid. Each neighbouring pair of electrons would like to form a molecule and 
share the electrons. But each atom has eight neighbours and has only one 


Electrons in Metals 243 


electron to be shared with all of them. So this electron is sent on a very complex 
course, visiting each neighbour in turn. The other eight electrons from these 
neighbours also travel round the complex, each of them spending about one 
eighth of its time with the central atom of our little group. Then, since this is 
only one cell of a complete crystal lattice, other electrons will come in from 
further cells—and so on. In other words, instead of having a cosy arrangement 
in which each ion has its ‘ own’ electron bound tightly to it, we have a sort of 
communist society in which all the ions possess all the electrons in common, 
and the electrons can move freely from one ion to another. 


a C) 


Fig. 2. (a) Sodium atom; (6) Sodium molecule; (c) Sodium metal. 


This idea is quite enough to explain, in general terms, many characteristic 
properties of metals, most of which stem from the fact that metals are good 
conductors of electricity. Obviously, if the electrons are very mobile, we may 
apply an electric field to draw them out of one side of the crystal—and go on 
drawing them out so long as we feed them back again on the other side to keep 
the whole crystal electrically neutral. That is, we can easily pass an electric 
current through the solid. 

Contrast this behaviour with what happens in an insulator, such as NaCl. 
Here the valence electron of the sodium has been swallowed by the chlorine 
atom, giving an Na* ion and a Cl- ion, both with complete closed shells. To 
make a current flow, we must cither make the chlorine ion regurgitate the 


244 J. M. Ziman 


electron—a process costing many electron volts in energy—or we must force 
a whole ion to move through the lattice—a very slow process, only possible in 
practice if the crystal structure is imperfect. 


3. JELLIUM 


It is typical of modern physicists that they will erect skyscrapers of theory 
upon the slender foundations of outrageously simplified models. We have an 
array of ions, each with charge enough to bind an electron if left to itself, and 
an equal number of electrons which seem to be able to move around a bit 
amongst the ions. So we take the bit between our teeth, and assume that the 
ions are not there at all! Or, rather we smear them out into a fixed uniform 
background of positive charge—a sort of jelly—in which the electrons move 
quite freely. We need this jelly to make our system electrically neutral, so that 
the electrons will not be driven explosively apart by their coulomb repulsion. 
It is sometimes instructive to think about the effects of elastic waves in this 
medium, but generally speaking it can be ignored. ‘Thus the properties of 
jellium are those of a free electron gas. 

On the face of it this model seems wildly unrealistic. How can we simply 
ignore those strong local forces, varying so rapidly in the neighbourhood of each 
ion? In Part III we shall see that it is a much better representation of the metal 
than one would have thought at first. 


4. THe WIEDEMANN—FRANZ LAW 


We saw that high electrical conductivity is the basic ‘ metallic’ property. 
Naturally we wish to calculate this for our free electron gas. That is easy; 
the conductivity would be infinite. But then if we think of jellium, we remember 
that the positive jelly will be subject to fluctuations of density due to thermal 
vibrations. A local condensation of the jelly (i.e. a local increase of average 
packing density of the ions) will look like a local positive charge, and this will 
scatter the conduction electrons. Again, there may be an impurity in the metal, 
an element of different valency whose ion cannot be smeared into the back- 
ground. Or, again, there may be a vacant site, or an interstitial atom, or a 
dislocation line, or a grain boundary, where we can no longer suppose the 
medium to be homogeneous and smooth. All such objects will scatter the 
electrons in the gas.f 

To allow for such effects let us introduce a relaxation time, +, such that an 
electron is only free for rt seconds (on the average), before it is scattered. Now 
apply an electrical field, E. ‘The force on the electron will be eE, and it will be 
accelerated. In 7 seconds it will acquire a drift velocity 


u~eEr/m 
in the direction of the field. After each collision with an impurity, etc., this 
velocity will be lost, or so changed in direction as to be effectively zero. If 


there are n electrons per unit volume, each carrying the charge e, this drift 
motion will be equivalent to an electric current density 


J~ neu. (1) 


+ But the actual ions of the lattice do not scatter the electrons; they alter their dynamical 
properties. ‘This will appear in Parts II and III. 


Electrons in Metals 245 


In other words the electrical conductivity of our electron gas can be written 
o=J/E ~ ne*r/m. (2) 


This very simple and important formula tells us what we might have guessed, 
that the conductivity of a metal is proportional to the number of free electrons 
per unit volume. But to compare this with experiment we need a theory for 
the relaxation time 7, and this is far above our present level of argument. 
Nevertheless there is one very instructive comparison that can easily be made. 


Think what happens if, instead of an electric field, we establish a temperature 
gradient, grad 7, along our specimen. At temperature 7, an electron has 
energy 3kT, where k is the Boltzmann constant. An electron ina ‘ hot’ region— 
temperature 7'+ 67 say—will have excess energy $k87, and will tend to diffuse 
into a ‘colder’ region. This excess energy changes as we go along the wire, 
just as if there were a field of potential acting along it. It is as if there were a 
force of magnitude =k (grad T) acting on each electron. This thermodynamic 
force gives rise to a drift of electrons down the wire, controlled, as in the electrical 
case, by collisions with imperfections, etc. ‘The average drift velocity comes 


out by the same argument: 
u~ 3k(grad T) 7/m. 


Each electron carries heat kT with it. The heat current will thus be 


U ~(3)’nk?T u (3) 
as if there were a thermal conductivity of the electron gas: 
«= U/(grad T) ~ (3)?nk?Tr/m. (4) 


The beauty of these formulae is that we may eliminate the unknown relaxation 
time, and discover the following general relation: 

«oT ~ (3)? /e* (5) 
This tells us that the electrical and thermal conductivities of a free electron gas 
are not independent of one another: it has been known as an empirical law 
since it was discovered by Wiedemann and Franz in 1853. Our derivation is 
rather informal (as indicated by the use of twiddles instead of equality signs) 
but it turns out to be quite a sound result. The strict derivation, in the full 
panoply of statistical quantum mechanics, only introduces a numerical factor 
n?/3 instead of (3/2)? but tells us that the law is not true if the electron is 
scattered with change of energy. 

We now see that high thermal conductivity is also characteristic of metals, 
being associated with the high electrical conductivity of the electron gas. The 
experimental confirmation of this relation tells us that the same carriers are 
responsible for both phenomena. 


5. Tue HALL EFFECT 


There is another transport effect which can be discussed in terms of this 
simple classical model. Suppose that we have a strip of metal with an electric 
current flowing uniformly along it. We apply a magnetic field at right angles 
to the strip. The electrons, being charges moving in a magnetic field, will be 
deflected to one side. We might think that this would impede the flow of 


246 J. M. Ziman 


current. But a charge will then build up along the edge of the strip. Further 
electrons approaching the edge will be deflected back. We oom) goer 
stationary state, in which the deflection caused by the magnetic field is ba es 
by the electric field of the system of charges along the edges of the strip, an 
current flows as before. This electric field can be detected as a potential 
difference at right angles to the current direction and to the magnetic field, 
and is called the Hall field. 


floms Charges deflected; Space charge | Electricfield _ ; Current stays 
oe i | by oagnctl field | builds pee \ deflects charge beck | paraite| toans 


Fig. 3. The Hall effect. 


To calculate this effect is easy. Each electron moves with drift velocity u, 
so that, by the standard Lorentz formula of electrodynamics, the magnetic 
field exerts a force 


F=‘u,H. (6) 
c 
This will be exactly balanced if the Hall field is of strength 
eB y= —F= — <uH. (7) 


We can express this in terms of the electric current density, J = neu, by multiply- 
ing through by the density of electrons in the specimen. Thus 


1 
i ark a | 9 
E,=+ aa Hy, (8) 


In the standard configuration where J and H are deliberately constrained to 
be at right angles, we can say that the Hall voltage is proportional to the strength 
of the magnetic field, and to the magnitude of the current, with the Hall 
coefficient 


Pcnmek (9) 


This derivation of the formula is again rather crude, but the result is correct 
for a free-electron gas, classical or quantal. It is interesting because we can 
use it to discover the sign of the carriers, and their density. The Hall effect is 
linear in e, and should therefore be negative for electrons (it is a good exercise 
in mental ping-pong to visualize fig. 3 with the charges reversed in sign!). 
Most metals do have ‘ normal’ Hall coefficients, but there are some quite 
ordinary metals (Pb, Zn) which behave as if the carriers were positively charged. 
Even in the normal case, we do not always find that the density of electrons is 
the same as the total number of valence electrons that can be put into the pool. 


Electrons in Metals 247 


These anomalies cannot be explained within the free-electron model; we shall 
see later how they arise through the interaction of the electrons with the ionic 
lattice. 

Another anomaly may be noted. The Hall field exactly balances the magnetic 
field, so that the electrons are, in the end, subject to no transverse forces. Thus 
the conductivity of the metal measured along the current direction does not 
depend on the magnetic field. Experimentally, however, all metals show a 
transverse magnetoresistance (increase of electrical resistance of a wire placed 
transverse to a magnetic field) which would not occur in a free electron gas. 


6. SPECIFIC HEAT AND THERMOELECTRIC EFFECT 


The ionic lattice, with N ions per unit volume, will have a specific heat, at 
high temperatures, 3Nk. If the electrons are free inside the lattice, they will 
contribute a further $vk to the specific heat, for we have assumed that they 
form a classical gas of particles without internal degrees of freedom. But this 
contribution is not specially noticeable in practice; metals obey Dulong and 
Petit’s Law (1819!) almost as well as insulators; the atomic heat tends to 
3Nk=3R at high temperatures. 

This is a famous objection to the free-electron theory of metals. Indeed 
it is an insuperable obstacle within classical physics. If the electrons are free 
enough to carry electric currents then they must contribute those degrees of 
freedom to the specific heat. 

A more subtle discrepancy having the same basic origin is in the thermo- 
electric power. According to classical theory, this should be much larger than 
is actually observed. We can deduce this quite simply from eqns. (1) and (3), 
where we expressed the electric current, and heat current, in terms of the drift 
velocity of the electrons. For any value of u, these have the ratio 


ay oH, wa (10) 


Suppose now (fig. 4) that we establish a closed circuit of two metals, A and B, 


Electric current Heat current 
——————-> 


Fig. 4. The Peltier effect. 


and make a current J flow round it. Even if we keep the whole circuit at nearly 
constant temperature, there will be a heat current drawn round with the 


248 J. M. Ziman 


electrons, of amount IJ. But suppose, for some reason, that IT is not the same 
in metal A asitisin B. Then the heat current U,=II,J arriving at the A—B 
junction will not be the same as I,J leaving it. Ignoring Joule heat (which is 
quadratic in J, and can therefore be made negligible by making J small) we must 
reject the heat (II, —I1,)J at this junction. At the other junction the same amount 
of heat will be absorbed. 

This is the Peltier effect, and (10) gives us at once the Peltier coefficient for 
our classical gas. There is a familiar thermodynamic argument (the Kelvin 
relation) which links this with the Seebeck effect, whose measure is the absolute 
thermoelectric power 


Qan/r~ #2. (11) 


Thus, in our free-electron model, all metals should have the same constant 
thermoelectric power, of magnitude about 100 uv/deg: the observed values 
are much smaller, up to, say, 10v/deg, and are proportional to the absolute 
temperature.} Moreover, in some metals QO is positive, instead of having the 
same negative sign as the charge on the electron. 


7. 'THE EXCLUSION PRINCIPLE IN A GAS 


The puzzle of the specific heat of the electron gas can only be solved within 
quantum theory. It is a consequence of the Pauli exclusion principle. No two 
electrons may be in the same state. In building up atoms we put one electron 
into each state (counting each orbital state as being doubled by spin); when we 
have filled up all the states in a shell we must move up to the next. 

The question now is how to apply this principle to our free electron gas. 
How can we be sure that each electron is in a different ‘ state’ ? 


(a) Cb) 


Fig. 5. (a) Classical jellium; (6) Cellular jellium. 


One obvious way of doing this is to erect impenetrable walls about each 
electron, confining each to its own cell (fig. 5). But this localization costs energy. 
If the width of a cell is a, this is the maximum uncertainty in, say, the x 
coordinate of position of the electron. Correspondingly, the momentum of the 
particle in this direction will have a minimum uncertainty <p,), such that 


ad pay woh, (12) 


} Of course this discrepancy could not have been apparent historically, as one could 
only measure differences of thermoelectric powers; the observed e.m.f.’s of thermocouples 
could then be written off as small differences from the classical formula for different metals. 


Electrons in Metals 249 


Properly speaking, these uncertainties are root mean square deviations, so that 
we ought to write 


m2h2 


Ce Nias (13) 


There will be similar rules for the y and x components. Adding these we 
obtain a formula for the mean total kinetic energy of the electron 


2772 
Ha 


1 
8) = 5a Sat) +<Pu®> + Cpe?) | w 8 OE (14) 


(I have cooked the constants in the uncertainty principle to get the correct 
answer). 

This is a large energy—several electron volts. So we have a lot to gain, in 
our jellium model, by dropping the barriers and allowing the electrons to wander 
at will. Instead of keeping our states distinct in space, we allow them to overlap 
freely. In fact, we now go to the other extreme, and study electron wave 
functions covering the whole ‘ box’ in which the gas is confined, the whole cube 
of metal. As in a typical atom, we must think about the ‘ different’ wave 
functions which may be constructed in the same region of space, and then we 
put two electrons, one of each spin, into each such state. 

This problem, of ‘ counting states’ in a box, is an old one, which goes right 
back into the theory of elastic and optical vibrations. For our purpose the 
simplest argument is from the de Broglie formula: the wavelength of an electron 
of momentum p is 


ah 
A= —. 15) 
, (15) 


Now suppose we have a cubical box of side La. We can get an exact number, 
s, of waves across the box if, say 


sh= La. (16) 


A wave of this wavelength, reflected from the side, and back again, would repeat 
itself exactly, and would thus be an acceptable stationary state of the system. 
If our wave is not travelling transversely across the box, there will be a similar 
rule governing the distance between peaks measured normal to the walls 


(figs. 6 and 7) 


7. (17) 


Fig. 6. Wavelength conditions Fig. 7. Wavelength conditions for an 
for waves across a box. arbitrary direction in a box. 


250 J. M. Ziman 


Thus, if we take this normal as the x direction, we have a rule for the « component 
of momentum. 


La 


where sz is an integer. There will be similar rules for the y and z components, 
each of which must be an integral multiple of 27h /La. ‘The states are ‘quantized 
in momentum’ with the quantum numbers sz, sy, Sz, which are +ve or —ve 
integers. 

We want to count states, and fill them in order. We must express the energy 
of each in terms of its quantum numbers sz, Sy, sz. Clearly 


: 2 APh2 ($42 + 5424522) 
6 (Sa; Sy, Sz)= oye == pat aoe (19) 


In (12) the symbol a stood for the side of one little cubic cell containing one 
electron. If there are n electrons in the box of side La, then n=L°. We could 
construct almost exactly m states by making each of the integers sz, Sy, $z run 
from —4L to 4L, for this would give us almost exactly L* different combinations 
of quantum numbers. In that case, the highest states would have the energy 
3 2h? 
2ma2’ 


pea am cos ap a (18) 


6(+4L, +4L, +4L)= Co 

It is interesting that this is exactly the result that we derived in (14) for the 
energy of each electron combined to its own cell. This highest state is actually 
a solution to the problem with cell barriers, because these can be made its 
nodal surfaces. What we have done, in effect, is to replace ‘ localization in 
co-ordinate space’ by ‘ localization in momentum space’. Because the electron 
is now allowed to range over a distance La we may define its momentum within 
the uncertainty limits + 7f/La. Whereas before, we kept each state distinct 
by not allowing the wave functions to overlap in real space, now we can only 
distinguish the different states if they differ in momentum by twice the limits 
of uncertainty in momentum. Instead of quantizing our system by a set of 
labels telling us which cell in real space the electron is in (i.e. the 7th room in 
the mth corridor on the nth floor), we label each state by quantum numbers 
which tell us which cell it occupies in ‘momentum space’. 

This new quantization scheme does not change the energy scale. We 
cannot avoid a zero point energy of this order of magnitude by any means. 
But there is a net gain in freeing the electrons from their cells. The energy 
27h? /ma? is the energy of each electron in the cellular scheme; it is the highest 


} At this point let me say that I have cheated. If one is looking for stationary states in 
a box with perfectly reflecting walls these are standing waves like cos (xp;z/h+ ...), eLCr 
which have nodes at every half wave length. For these p, is quantized in units of zh /La, 
so that we seem to have twice as many states as here. But then, of course, negative values 
of pz, etc., only repeat the same range of functions, so must not be counted separately. It 
is much more convenient to split these into pairs of travelling waves, like exp (ixpy/h+ ...) 
and exp (—ixp,/h . . .), which have the same energy but are distinct mathematical functions. 
Travelling waves do not satisfy the ‘ reflecting ’ boundary condition, but one can set up 
other boundary conditions (‘ Born—von Karman ’, or ‘ cyclic’ conditions) which are not so 
intuitively obvious but which lead to the above quantization rules. There is a theorem 


that says that in a large box we get the same distribution of energy levels whatever the 
shape and condition of the boundary. 


Electrons in Metals Wil 


electron energy in the gas scheme. Thus, the average electron energy in the 
gas must be lower than this—in fact by 27/?/ma®. This is quite a large energy— 
of the order of one or two electron volts in ordinary metals. Very crudely, we 
have here a source of the cohesive energy of metals. There are many complica- 
tions in an exact calculation, but it is still approximately true to say that by 
bringing the atoms together and allowing the valence electrons to escape from 
their cells, we gain energy of about this amount. We can see that this cohesion 
depends, basically, on getting an electron gas of fairly high density, not on the 
details of the crystal structure. There is no need to construct special ‘ bonds’ 
between neighbouring ions; the electron gas behaves as a ubiquitous liquid glue 
that will bind together more or less any arrangement of ions that is sufficiently 
densely packed. 

This explains the tendency of metallic crystals to be ‘ close-packed ’, yet not 
strongly preferring one close-packed structure to another. We can understand 
the complex phase diagrams of alloy systems, where the balance of energy in 
favour of one structure may easily be tipped towards another phase by a small 
change of temperature or composition. ‘The same general argument explains 
why metals are characteristically malleable and ductile. There are no specific 
bonds to be broken so that the atoms may be made to ‘ flow’ plastically, without 
spoiling the cohesion of the solid. 

Yet it must not be thought that quantization of momentum is necessarily 
the most favourable scheme for the valence electrons in all solids. In an ionic 
crystal, for example, the attraction of the spare electron from the metal atom to 
the halogen atom, and its localization there in a closed shell, easily outweighs 
the advantages of forming an electron gas. It is a nice point to decide the 
criteria favouring one or the other scheme, and to describe the transition between 
them that can be made to occur in some cases by compression or change of 


104 

10? 
ia 
'e 
U 
Ww 
b 
> 

= 10° 
U 
=) 
no) 
| 
S 

On 

loo 200 
TCK) 


Fig. 8. The conductivity of oxides of vanadium, as a function of temperature (schematic). 


Ze, J. M. Ziman 


temperature. The best evidence is that this transition is sharp. The valence 
electrons must either be closely localized, each to a region of the size of a unit 
cell of the crystal, or else they must be quite delocalized, in states that can only 
be specified by momentum variables. These two schemes are quite distinct, 
and there do not seem to be any intermediate cases. 


8. THe FERMI ENERGY 


The estimate (20) of the energy reached when we have put all the electrons 
into different momentum states is actually too high. In the first place, the 
distribution of quantum numbers —4L <sz, sy, 5z<$L is not the optimum. 
We can see this by plotting out a point for each value of sz, Sy, Sz (fig. 9), in a 


@e2eeeee8d2 2 @\ 


im —> 
Sx 
@eeeee eed 
ee 
ee 
ee 
ee 
ee 
ee 
ee 
ee 
eo 
ee 
lowe) 


Fig. 9. The distribution of quantum numbers of allowed states in ‘momentum’ space. 


three-dimensional space. These make an array of points, equally spaced, and 
the states in the range we have counted are those inside the cube (square) out- 
lined in fig. 9. But from (19) we see that the energy of a state, in such a figure, 
is just the square of the distance of its representative point from the origin. 
But then points at the corners of the cube will have higher energy than those 
on the edges or faces. It is obviously favourable, energetically, to replace these 
higher states by others still empty and of lower energy. In fact, the best arrange- 
ment will be one where we only fill those states lying inside a sphere which 
contains the same number of points as our original cube. 

Then again, each electron has two spin states and these are essentially 
distinct from the point of view of the Pauli principle. Thus, if we have 
n electrons per unit volume, we only need to use 4m distinct wave functions. 
So our sphere need contain only half the volume of our original cube. Ignoring 


Electrons in Metals 253 


slight errors due to the discreteness of our distribution of points, we can say that 
this sphere must have a radius D such that 


$nD3=4L8, (21) 


A point near the surface of the sphere would then have the energy 


47?h2 (D2 3\% (mh? 
2ma2 & ) = (=) (Fa) ) 
This is the true energy of the highest occupied states in our distribution of 


electrons. 
Let us write this energy in another form: 


an2 2 
tee (=) : (23) 


2m T / 


where we have replaced a®, the volume of a cubical cell that would contain one 
electron, by 1/n. This shows us explicitly that the Fermi energy does not really 
depend on the size of the big cubical box or the size of the little cubical cells 
into which we divided the box. It is a function only of the density, m, of the 
electron gas, and not on such accidental features as the shape and size of the 
container. All the scaffolding of boxes, cells, stationary states and boundary 
conditions can now be removed, revealing this simple fact: if we apply the 
Pauli principle to a free-electron gas of density , then the electrons must be 
supposed to fill states of kinetic energies up to &,—an energy which is only a 
bit smaller than the energy each electron would have if it were confined to a 
cell of volume 1/n. 


9. FeRmMI—DIRAC DISTRIBUTION 


The Fermi energy is kinetic energy, of the order of several electron-volts, 
say 5e.v. This is much more than the ordinary energy of thermal excitation: 
kT=0-03 electron-volts at room temperature. In terms of electron velocity, 
the electron gas looks as if it is extremely hot, with a temperature 


T»=Ey|k ~ 50 000°. (24) 


The free electrons in a metal must be travelling with velocities about 1000 times 
the velocity of the ions, for their mass is smaller by a factor 10*, and their energy 
larger by a factor 102. This is important for many theoretical investigations: 
we may usually treat the ions as if they were at rest, frozen into some typical 
configuration in the course of their thermal vibrations, whilst the electrons 
swarm about them. It is the basis of what is called the adiabatic principle in 
such calculations. 

But although the electron system always has a high kinetic energy, it is not 
necessarily ‘hot’. For example, it will still retain the high Fermi energy when 
it is at absolute zero, with the whole system in its ground state of energy. This 
is obviously just the state which we have assumed in calculating the Fermi 
energy: all the levels below this energy are filled and all the higher levels are 
empty. The system is obviously not ‘ hot’, since it cannot give up any of its 
energy to a colder body. 


254 J. M. Ziman 


(a) CD») 


Fig. 10. Fermi—Dirac distributions (a) at T'=0; (6) at a finite temperature. 


Now if we heat this system, what can we do to the distribution of electrons? 
They can exchange energy with the lattice, in parcels of about kT at a time. 
This is easy enough for the electrons at the top of the distribution—they can 
move up into the empty levels above &,. For those deeper down, however, this 
is impossible. The levels above them are already full, and, by the exclusion 
principle, cannot take any more electrons. So these electrons are inhibited 
from exchanging energy with their surroundings, unless one can find a means of 
carrying them into an empty state above the Fermi level. ‘This would require 
much more energy than one can get from ordinary thermal fluctuations. It 
would be, perhaps, an optical transition, of several electron volts. 

Thus, although we have a very large number of electrons in our system, 
with kinetic energies ranging from zero to 6, when we heat it up we only affect 
those at the very top of the distribution. The same goes for any low energy 
excitation, such as application of an electric field, a magnetic field, or an acoustic 
vibration. The only electrons which can be affected are those in a narrow layer 
near the Fermi level. ‘The other electrons can be treated as if they were quite 
inert, and no more capable of being excited than if they were in bound states 
deep down in the closed shells of the metallic ions. The information that we 
can obtain about the states of the electrons in a metal is thus limited. From 
the transport properties and some of the magnetic properties we can learn about 
electrons near the Fermi level; to study other states we must use optical excita- 
tion, or X-rays, whose effects tend to be brutal and difficult to interpret. 

The formal theory of the statistical mechanics of a system such as this, of 
particles obeying the Pauli principle, is due to Fermi and Dirac. The rule is 
that the average occupation number of a state of energy & is given by 


: 1 
Fé) exp {(é-&p)/RT} +1 
One can easily see that this number is almost exactly 1 if &<&,, and that it is 
exponentially small (like the high-energy tail of the classical Boltzmann distribu- 
tion) when 6>6y. The only region where f(&) is appreciably different from 
1 or 0 is near éy, over a width which depends on temperature, being a small 
multiple of kT. We say that we have a degenerate Fermi gas, because & gk, 
The names ‘ Fermi level’, ‘ Fermi energy’, and ‘ Fermi surface’ come from 


this formula; Enrico Fermi himself did not make any other direct contributions 
to the theory of metals. 


(25) 


Electrons in Metals 255; 


Er fe 
+ 
cL 


Fig. 11. The Fermi—Dirac function. 


10. ELECTRONIC SPECIFIC HEAT 


We can now solve the puzzle of the specific heat of the electrons. When we 
raise the temperature to 7, we do not increase the energy of ail the electrons 
by the amount kT. Only a fraction of them, of the order of RT/&,, are affected. 
Thus, the extra heat energy per unit volume is 


kT 
This is equivalent to a specific heat of the form 


ASW) — 2nk?T | 
aT és 


Ca= yT. (27). 
The electronic contribution to the specific heat is thus (a) not constant but 
linear in T and, (b) smaller by a factor like RT/&, (e.g. 1/100) than the classical 
specific heat ($)nk. It is not surprising that it cannot be noticed at ordinary 
temperatures. One must go to very low temperatures, where, according to 
the Debye T? law, the lattice specific heat becomes very small, before the linear 
term can be detected. 

The smallness of the specific heat of the electrons also explains the small 
thermoelectric power of metals. In fact, if we introduce a factor 2kT/é, into 


(11), we find 
O~ aif ee (28). 


which is a fair approximation, in order of magnitude and in dependence on tem- 
perature, to the observed thermoelectric power of a simple metal such as sodium. 
The electrons carry a much smaller net current of heat than we calculated in 
§ 6, because their specific heat is so low. 


256 J. M. Ziman 


REFERENCES 


{Brittouin, L., 1946, Wave Propagation in Periodic Structures (New York: McGraw Hill). 
+Harrison, W. A., and Wess, M. B. (eds.), 1960, The Fermi Surface (New York: Wiley). 
Jonrs, H., 1960, Theory of Brillouin Zones (Amsterdam: North-Holland). 
{Krrrer, C., 1956, An Introduction to Solid State Physics, 2nd Edition (New York: Wiley). 
*Morrt, N. F., and Jonss, H., 1936, Theory of the properties of metals and alloys (Oxford: 
Clarendon Press). 
Pererys, R., 1955, Quantum Theory of Solids (Oxford: Clarendon Press). 
+Prreparp, A. B., 1960, Experimental Analysis of the Electronic Structure of Metals—Reports 
on Progress in Physics XXIII, p. 176 (London: Physical Society). 
Seitz, F., 1940, Modern Theory of Solids (New York: McGraw Hill). 
{Siarer, J. C., 1951, Quantum Theory of Matter (New York: McGraw Hill). 
Suater, J. C., 1956, The Electronic Structure of Solids, Handbuch der Physik XIX (Berlin: 


Springer). 

Wannier, G. H., 1959, Elements of Solid State Theory (Cambridge: Cambridge University 
Press). 

Witson, A. H., 1953, Theory of Metals, 2nd Edition (Cambridge: Cambridge University 
Press). 


ZIMAN, J. M., 1960, Electrons and Phonons (Oxford: Clarendon Press). 


+ Basic references for the new techniques for studying the Fermi surface. 
t Relatively elementary. 


Some Physical Aspects of Lubrication and Wear 


by A. W. CROOK 


Research Laboratory, Associated Electrical Industries, Aldermaston Court, 
Aldermaston, Berkshire 


SUMMARY 


The application of physical principles in some studies of lubrication and 
wear is described. The states of boundary and hydrodynamic lubrication 
are contrasted and the importance of identifying the state of lubrication of 
a system is emphasized. Experiments which demonstrate that the lubrication 
of gears and roller bearings is essentially hydrodynamic are described and the 
consequences of solid-solid contacts through the hydrodynamic film are 
discussed. The function of boundary additives such as are present in E.P. 
oils in mitigating the damage consequent upon solid-solid encounters is 
described and so is the influence upon damage of the surfaces themselves. 
Lastly the importance is indicated of wear experiments conducted without 
lubrication. Such experiments have a relevance to solid-solid encounters 
through hydrodynamic films and are of direct application to systems in 
which fluid lubrication cannot be tolerated. 


1. INTRODUCTION 


So often a little oil suffices to still a squeal or to restore a neglected machine 
to use that sometimes, without conscious effort, the scientific interest of lubrica- 
tion and wear and their economic importance pass unmarked. 

In industry, particularly to the designer, lubrication and an acceptable rate 
of wear can be critical. For instance the possibility of reducing cost by replacing 
a machined component by a die casting, by plastic or by a sintered compact of 
metal powder, might turn upon the wear and carrying capacity of bearings 
formed in the material. Nuclear power reactors have brought problems in 
conditions hitherto unknown. — In the gas-cooled reactor fluid lubricants cannot 
be used within the pressure vessels, yet the bearings and gears of control 
mechanisms and charging machines must work reliably in circumstances which 
offer few opportunities for maintenance. In high speed flight mechanisms 
must work at temperatures higher than have had to be accepted before. In 
large gear boxes such as transmit power from a ship’s turbine to her propeller, 
ability to lubricate the teeth so they are undamaged is the present limitation 
upon ratings. In addition to such problems there is the ever-present burden 
of depreciation due to wear. 

Partial solutions technically acceptable have been found for many of these 
problems prior to an understanding of the underlying science. That is 
fortunate, for the science is incomplete. Indeed it seems remote that principles 
so definite will emerge that trial and error can be eliminated. But the search 
for principles is worth while; worth while in itself and to provide as rational a 
background as can be for the technical solution of the many particular problems 
continually arising. 


CP. 


258 A. W. Crook 


1.1 The inter-relation of lubrication and wear 


It is axiomatic that when lubrication is complete there can be no wear. 
But seldom is lubrication complete. There are breakdowns through oil film 
occasioned by the severe conditions when surface roughnesses meet. Wear can 
be reduced by choosing surfaces tolerant of solid-solid contact. Case-hardening 
is a traditional measure, whilst today there are available, for example, chemical 
pre-treatments such as phosphating and oils carrying chemical agents (E.P. oils) 
which, under the agency of the temperatures arising where solid-solid contact 
occurs, locally form films which protect the surfaces at subsequent encounters. 
Another way of reducing wear is to increase the thickness of the oil film so that 
solid-solid contact becomes less frequent. 


1.2 Types of lubrication 


Lubrication can be classified as boundary or hydrodynamic. In boundary 
lubrication it is postulated that the surfaces are protected by films of molecular 
thickness physically or chemically adsorbed upon them. For example, if an 


Fig. 1. Journal and bearing. The upper curve shows the oil pressure (P) 


organic acid (polar molecules) be dissolved in a paraffinic base (non-polar 
molecules) then, provided that the temperature is not too high, a monomolecular 
layer of polar molecules will be adsorbed by the surfaces (Bowden and Tabor 
1950). ‘The chemical agents of E.P. oils provide a companion example of chemi- 
sorption but in both cases, with respect to boundary lubrication, the bulk of the 
lubricant is irrelevant except as the carrier for the active molecules. But in 
hydrodynamic lubrication the viscosity of the bulk is of great importance. This 


Some Physical Aspects of Lubrication and Wear 259 


form of lubrication will be described by reference to the plain journal bearings 
with which it was elucidated in the 1880’s by Beauchamp Tower and Osborne 
Reynolds (see Barwell 1956). The journal in fact runs eccentrically in its 
bearing (fig. 1). Because of viscosity the oil is dragged by the journal through 
the bearing. On the entry side the oil passes through a converging gap and, as 
Reynolds showed from the differential equation of hydrodynamic flow, (which 
in this connection may be written as 
2 
. s Te (1.1) 
we oy 
where P is pressure, x is measured peripherally, 7 is viscosity, u the velocity of 
the oil measured against the peripheral direction and where y is measured 
radially) the oil thereby generates a pressure as shown in the figure. It is this 
pressure integrated over the bearing which supports the load. 

In contrast with boundary lubrication in hydrodynamic lubrication the 
thickness of the oil film depends upon viscosity, speed and load and it is therefore 
possible by design to vary the thickness. With journal bearings the thickness 
is typically 1 x 10-* cm; vast compared with the molecular films of boundary 
lubrication. 

If in all instances such hydrodynamic films were obtainable then wear would 
be a much less pressing problem. But that is not so. Even when a hydro- 
dynamic film exists it does so only when the machine has been brought to speed 
and other protection is required on stopping and starting. Furthermore, in 
some instances it is by no means obvious which type of lubrication obtains. 
But it is necessary to know this before rational solutions of the problems of 
lubrication and wear can be hoped for. 


2. LUBRICATION WHERE STRESSES ARE HIGH 


In gears and roller bearings the loads are borne nominally along lines or at 
points (ball bearings). ‘The pressures are extreme and may be estimated in 
the first instance by reference to the Hertz theory of the contact of elastic solids 
(see standard works on elasticity). From the theory the deformations and pres- 
sures occurring where two elastic bodies (e.g. a ball against a flat or two cylinders 
with parallel or crossed axes) are loaded together may be calculated. Under 
typical conditions of load the theory gives pressures of 50 ton/in® or more for 
gears and roller bearings. The pressures in fact approach the limit the materials 
can withstand elastically, i.e. they approach their hardness. These pressures 
are much greater than those encountered with journal bearings in which the 
pressures are of the order 100 Ib/in®. But in another respect gears and roller 
bearings are similar to a journal in that they also possess at their conjunctions 
the feature of a convergent gap. For many years there has been uncertainty 
about their lubrication. The convergent gap has suggested that hydrodynamic 
lubrication might exist whilst the extreme pressures have persuaded many that 
boundary lubrication alone is possible. 

In a series of experiments it has been demonstrated that the lubrication is 
in fact hydrodynamic. The latest experiment will be described briefly as it 
brings out directly the salient features of the conjunctions. 


260 A. W. Crook 


2.1 The Cross-section of the oil film 


The essential features of the apparatus are illustrated in fig. 2a. A three 
inch diameter disc of hardened steel was rolled under load against a three inch 
glass disc. Both were accurately formed and polished. ‘The glass disc carried 


es Leading 
edge 


Load~” 


(a) Cb) 


Scope 
R 


Ge) 


Fig. 2, The measurement of film shape: (a) arrangement of discs, (6) plan of electrode 
upon glass disc, (c) electrical connections to oscilloscope. 


the evaporated chromium electrode shown in plan in fig. 2b and electrical 
connections were made as in fig. 2c (in the figure the thickness of the electrode 


elh 


Recess <——— Entry 


Fig. 3. Oscilloscope trace of the potential across the resistor R of fig. 2c. 
(The passage of the leading edge of the electrode through the ion junction is from entry 
to recess.) 


Some Physical Aspects of Lubrication and Wear 261 


is grossly exaggerated). As the leading edge of the electrode approaches and 
passes through the conjunction of the surfaces the electrical capacitance between 
the electrode and the steel disc rises and it may be shown that, with a sufficient 
accuracy, the potential appearing across the resistor is proportional to ¢/h where 
his the gap separating the discs at the leading edge and ¢ is the dielectric constant 
of the intervening medium. Results have been obtained up to loads of 600 Ib/in. 


2 
hx10* 
cm. 

| B 

A 
0 
@) 20 40 60 
xx 107cm. 
REGESS eT het ENTRY 


Fig. 4. The cross section of the oil film in absolute measure 


of face width and peripheral speeds of 30 ft/sec. Such a load and such a speed 
are comparable with those met in turbine reduction gears. A typical oscilloscope 
trace of the potential across the resistor is shown in fig. 3. From such traces 
the shape of the oil film in cross-section may be obtained in absolute measure, 
A typical shape is given in fig. 4. 


B A 


Fig. 5. The Hertz pressure distribution at the contact of elastic bodies. 


For its particular conditions the figure shows the film thickness to be 
approximately 1p (1u=1x 10-4 cm) i.e. large compared with molecular dimen- 
sions. The lubrication is hydrodynamic. Furthermore, between the points 
A and B the film thickness is approximately constant (space prohibits discussion 


262 A. W. Crook 


of the small constriction at the recess end) and this allows some simplifications 
to be made. Clearly the discs must be deformed by the hydrodynamic pressures 
much as they would be were they loaded together statically because, although 
separated by a film, its constant thickness implies that the shapes of the icp 
must be complementary as they would be in dry contact. iach ue 
pressures developed hydrodynamically must be approximately those of t : 
Hertz theory, i.e. the pressure distribution must be approximately semi-elliptica 


h* x 10° 


OF 


“u 


| aecaaaeeey 


o 9 00 9 


20 30 40 60 100 200 400 = uns dyn cm7} 


(a) 


Ke) 20 30 40 60 = 100-180 200 300 Uns dyn cm~! 
(b) 
Fig. 6. Film thickness as a function of Uns 
(a) For rolling discs, (6) For discs with peripheral speeds in the ratio 1 : 1-5 


(h* is film thickness, 7 is the mean peripheral speed of the discs and ys is the viscosity of 
the oil at the surface temperature of the discs.) 


Some Physical Aspects of Lubrication and Wear 263 


(fig. 5). In addition, as the surfaces are parallel, the conjunction of the discs 
can be regarded as a parallel plate condenser and as a parallel plate viscometer. 
By so doing and after allowance has been made for contributions arising outside 
of the parallel section, film thicknesses may be deduced from measurements of 
the electrical capacitance between two steel discs (or of other metal), and with 
this knowledge of thickness, the effective viscosity of the oil within the pressure 
zone may be deduced from observations of the relationship between frictional 
traction and sliding speed. 


2.2 Film thickness 


Film thicknesses have been deduced over wide ranges of load, rolling speed 
(the mean peripheral speed of the two discs) and sliding speed (the difference 
of the peripheral speeds) from experiments with two discs of steel. The 
thicknesses were deduced from measurements of the electrical capacitance 
between the discs (a.c. bridge method) and also from other capacitance measure- 
ments which did not rest for their interpretation upon the postulate of a parallel 
film (Crook 1958). It was found, at loads greater than a certain value and within 
the range of practical interest, that the film thickness was independent of load, 
dependent upon rolling speed (w~) and upon the viscosity of the oil at the surface 
temperature of the discs (7s). Because of the spread of frictional heat into the 
discs their temperature differed from the temperature of the oil from the supply 
jet. The significant viscosity was found to be 7s, not the viscosity of the oil 
as supplied by the jet. 


Load 


Ix10° 


Ix!07 


Ix!0-? 1xlO® IxlO7? cm. 


Fig. 7. Measurements of film thickness (h* in angstroms) with crossed cylinders (inset) 
by Achard and Kirk (1961) (« is a quantity relating to the increase in viscosity with 
pressure). 


Some results are shown in the log-log plots of fig. 6. The film thickness h* 
is plotted against us. Results from experiments in which the discs rolled are 
given in fig. 6a whilst fig. 6b is from experiments in which one disc was rotated 


264 A. W. Crook 


at 1:5 times the speed of the other. The figure shows that the film thickness 
is independent of load and also, as a comparison of figs. 64 and 6b will demon- 
strate, that the film thickness, when measured at constant 7s is largely indepen- 
dent of sliding. The latter feature is perhaps surprising since the introduction 
of sliding implies a greater generation of frictional heat. 

Similar results have also been obtained by Archard and Kirk (1961) in 
experiments with crossed cylinders (fig. 7) which nominally make contact at a 
point. Upon reducing speed they were able to demonstrate the persistence of 
hydrodynamic lubrication down to a film as fine as 200 A. Before these measure- 
ments were made it would have been considered that only a state of boundary of 
lubrication could exist at such low speeds. 


2.3 Frictional traction and effective viscosity 


Some measurements of frictional traction as a function of sliding speed are 
given in fig. 8. The load was constant and u was held constant as the sliding 
speed was varied. As sliding is introduced the friction rises, reaching a maxi- 
mum, and then falls. Conveniently this behaviour may be considered in terms 


Frictional traction 


kg. 20 


SS) ee ee ee ee 


0) 200 400 
Sliding speed cm. s 


Fig. 8 Frictional traction as a function of sliding speed (load 300 kg, u 800 cm st) 


of a parallel plate viscometer. When the moving plate moves slowly the drag 
will be proportional to speed but as the speed increases the heating of the lubricant 
becomes appreciable and its viscosity will fall. Evidently in the disc experiment 
the reduction in viscosity ultimately more than offsets the effect of an increase 
in sliding speed. 

From the friction curve and a knowledge of film thickness the effective 
viscosity within the pressure zone may be deduced. By effective viscosity is 
meant that constant viscosity throughout the pressure zone which would give 
rise to the observed frictional traction. A typical graph of effective viscosity 
as a function of sliding speed is given in fig. 9. The interrupted line gives 
film thickness. 

The curve shows the effective viscosity at the rolling point to be 600 poise 
This is 1500 times greater than ys which was 0-4 poise. Upon the introduction 


Some Physical Aspects of Lubrication and Wear 265 


of 400 cm s~ sliding the viscosity fell fortyfold to 14 poise, yet the film thickness. 
fell by only 10%. 


1000 


Effective 
viscosity 
poise 


100 


0 200 400 
Sliding speed 
eMmxsa 


Fig. 9. Effective viscosity and film thickness as functions of sliding speed 


(————_ effective viscosity - —.—-— — film thickness) 


From the work at high pressures of Bridgman and others it is known that 
the viscosities of oils increase under pressure. ‘The high value of the effective 
viscosity at the rolling point is due to this effect whilst the fall upon the introduc- 
tion of sliding is due to frictional heating. 


2.4 Theory 


There is a considerable theoretical counterpart to these experiments and 
explanations of the main results can be given. An approximate theory of film 
thickness based upon equation (1.1) and embracing the effects of pressure upon 
viscosity and the deformation of the discs has been developed by Grubin (1949). 
It predicts film thicknesses reasonably in accord with those deduced from 
measurements of capacitance. Recently with the aid of computers, more exact 
calculations have been made (Dowson and Higginson 1960, Archard, Gair and 
Hirst 1961) and these also agree reasonably with experiment. As a necessary 
simplification in the early development of theory these analyses do not embrace 
frictional heating and therefore apply only to conditions near the rolling point. 
However, by an approximate analysis of the heat balance within the oil film 
(Crook 1961) a further extension of theory has been made which explains 
many of the observations relating to sliding. The analysis confirms that the 


266 A. W. Crook 


Temperature rise 


°C 300 


150 


0) 
RECESS ~ 


ENTRY 


Fig. 10 The rise of temperature within the conjunction 


on the median plane of the oil film 
— — — — at the surface of the disc 


temperature increments due to friction in pure rolling are negligible but shows 
that the temperatures rise rapidly as sliding is introduced. ‘These temperatures 
account satisfactorily for the fall in effective viscosity as sliding increases (fig. 9). 
In fig. 10 typical calculations of the temperature on the median plane of the film 
and at the surfaces of the discs are given. ‘The temperatures plotted are with 
reference to the bulk temperature of the discs. ‘The temperature on the median 
plane rises above that reference by 300°c. But despite that large rise in tempera- 
ture the theory shows that the oil nevertheless retains a viscosity sufficient to 
maintain a parallel film. It shows in addition that the temperatures on the 
entry side before the Hertzian region are small. In this lies the explanation of 
why film thickness is insensitive to sliding (fig. 6). 

The fact that film thickness depends upon the viscosity of the oil at the 
surface temperature of the discs and not upon the viscosity of the oil as supplied 
by the jet requires comment. ‘The apparent cause is that the oil passing through 
the conjunction and which alone is responsible for the lubrication, is carried by 
the discs as permanent skins with which oil from the jet does not merge. These 
skins are naturally at the temperature of the discs (Crook 1957). 


3. WEAR 


The discs of the above experiments were necessarily polished to obtain 
electrical conditions as undisturbed as possible. However with practical 
surfaces solid-solid contacts through the film are frequent. At the encounters 
protection of the surfaces depends upon the boundary lubrication afforded by 
active material in the oil physically or chemically adsorbed by the surfaces and 
in addition in some cases upon the protection afforded by chemical pre-treatments 
of the surfaces (e.g. phosphating). 

As ‘running in’ proceeds the solid-solid contacts become less frequent. 
This is illustrated by fig. 11a which shows the electrical resistance between two 
ground discs as a function of running time. The instantaneous resistance 
fluctuates wildly and it is the mean value averaged over the response time of the 
meter which is shown. Progress towards complete separation of the discs by a 


Some Physical Aspects of Lubrication and Wear 267 


hydrodynamic film is shown by the rise in resistance with time. In fig. 115 
profilometer traces of a surface as ground and after ‘ running in’ are compared 
with the thickness of the hydrodynamic film. The roughnesses of the ‘ run-in’ 
surface have been reduced by the running so that they can be accommodated 
within the thickness of the film (Shotter 1958) 


Resistance 


10.000 
Ohms 


100 


10 


0-| 
0. — 200-400: -600 


Time of running - minutes 


(a) 


HAA) GAAGE 


Fig. 11. ‘Running in’. (a) Electrical resistance between discs as a function of time of 
running; (b) Profilometer traces (sketched from originals). Left, before running in; 
right, after running in with thickness of oil film indicated 


3.1 Pitting 


It is from the solid-solid contacts through the film that surface damage 
arises. With gear teeth the most common form of damage is the development 
of localized systems of cracks which spread and ultimately free flakes of material 
from the mass. ‘The flakes fall away and leave pits (fig. 12). ‘This phenomenon 
has been studied extensively and has many features. But one is particularly 
germane to this article. It has long been known that polished discs are less 
susceptible to pitting than discs of rougher finish. Recently the relationship 
between pitting and surface finish has been studied systematically by Dawson 
(1961). He has demonstrated a strong negative correlation between revolutions 
to the onset of pitting and surface roughness when expressed in terms of the 
thickness of the hydrodynamic film. This is a noticeable example of the inter- 
relation of lubrication and wear; of the importance of knowing the state of 
lubrication in studies of wear. 


268 A. W. Crook 


ev 


I mm 


| 
Fig. 12. A typical surface pit. 


3.2 Scuffing 


The results of fig. 11 were drawn from an instance of successful * running in ’; 
successful in that there was no catastrophe although Dawson’s results would 
suggest that the solid-solid contacts of the process would in the end give rise 
to pitting. However with an untried system running in is fraught with un- 
certainty (the reader, interest heightened by a new car, should except his vehicle 
as being a tried system). Suddenly conditions become too severe for boundary 
lubrication to remain effective; the surfaces tear and the friction rises. This 
is known as scuffing and is due to local welding of the surfaces. It produces a 
great deal of subsurface deformation (fig. 13). 

Again there is a relationship with the thickness of the hydrodynamic film. 
For example this is shown by the fact that in experiments in which discs scuffed 
failure did not occur on starting the machine but was delayed by several minutes, 
even up to fifteen minutes, although there was no change in the external con- 
ditions. ‘This behaviour is explained by the influence of disc temperature upon 
film thickness. As the discs, due to the dissipation of frictional heat, become 
hot, the viscosity of the oil of the skins falls and consequently the thickness of 
the hydrodynamic film also. Solid-solid contacts increase in severity and 
exacerbate the heating until the support of the hydrodynamic film no longer 
suffices to prevent welding. 

But scuffing depends also upon the surfaces themselves. For instance in 
some circumstances changing a combination of two discs of alloy steel to a 
combination of one disc of alloy steel and a disc of mild steel has eliminated 
scuffing. In addition chemical pre-treatments of the surfaces and the use of 
E.P. oils are often effective. 

Hypoid gears are an outstanding example in which mineral oils by themselves 
were inadequate and a necessity arose for boundary additives more effective than 
were then available. It was to overcome the scuffing of these gears that E.P. 
oils were originally developed. Perhaps because of the success with which 
additives were found they came to be regarded as the true lubricant and the 


Some Physical Aspects of Lubrication and Wear 269 


(b) 


0:1 mm. 


area 
Fig. 13. Subsurface deformation; (a) scuffed, (6) run in. (Shotter 1958) 


bulk oil was relegated to being a medium for them and for heat transfer. How- 
ever, from the work with discs and crossed cylinders and from tests with hypoid 
axles an increasing probablility of hydrodynamic action is being seen. If the 
basic lubrication be hydrodynamic it then follows that the additives function 
with relation to the adventitious solid-solid contacts. 


3.3 Dry Wear 


It has been argued above that it is at solid-solid contacts where fluid lubrica- 
tion no longer persists that the crux of the problem of wear is to be found. It 
is therefore of value to study such encounters in isolation and this has been done 
with considerable success simply by rubbing materials together without fluid 
lubricant. In addition such experiments have the virtue of being directly 
relevant to circumstances where fluid lubrication cannot be tolerated, e.g. the 


pressure vessels of gas-cooled reactors. 


270 A. W. Crook 


Here it is possible to select only one experiment for brief description. The 
wear of a pin of 0-12 per cent carbon steel pressed against a rotating ring of the 
same material has been studied by Welsh (1957). The combined wear of both 
members for two loads is plotted against time in fig. 14. At the lower load 
(1 kg) the wear was proportional to time, and so was the wear at 3 kg initially 
and furthermore, in keeping with preconception, at a greater rate. But after 


Combined wear of 
pin and ring (y) 


Load 3kq 


OR ees 6 loo. ~—-:150 


Time —- minutes 


Fig. 14. The combined wear of a pin and a ring of 0-12 per cent. carbon steel (Welsh 1957) 


Fig. 15. The hard component in a worn ring of 0:34 per cent. carbon steel (Welsh 1957). 

(The photograph is of a taper section which gives an effective magnification greater in 
depth than in the transverse direction. The component is in fact approximately 
20 deep and 200 across.) 


Some Physical Aspects of Lubrication and Wear Jari 


five minutes the wear rate fell dramatically. Eventually the paradox existed of 
the greater load having caused the lesser wear. 

Welsh has shown that the explanation lies in the changing nature of the 
surfaces. At the load of 3 kg the fall in wear rate is due to the wear process 
itself producing a hard component (fig. 15); the conditions at a load of 1 kg were 
too mild to effect such a transition. A similar hard component can also occur 
in lubricated systems; its presence in a scuffed disc is shown by the white 
material near the surface in fig. 13a 


4. CONCLUSION 


The author has explained some of the concepts used in work upon lubrica- 
tion and wear and has shown an application of physical principles to the work by 
describing in some detail one study aimed at a particular problem. The selection 
of material has been influenced by experience of research within the electrical 
industry. Different problems would have brought a different emphasis. But 
nevertheless if the reader has been put more in possession of some scientific 
ideas touching lubrication and wear, the article has made its point. 

In writing the article the author has drawn upon ideas which have emerged 
from continuous discussions over many years between members of a group led 
by Dr. W. Hirst. The author thanks Dr. T. E. Allibone, c.B.£., F.R.s., Director 
of the Laboratory, for permission to publish this article. 


REFERENCES 


ARCHARD, G. D., Gair, F. and Hirst, W., 1961, Proc. Roy. Soc A. 262, 51. 
ARCHARD, J. F. and Kirk, M. T., 1961, Proc. Roy. Soc. A. 261, 532. 
BarweE tu, F. T., 1956, Lubrication of Bearings (London: Butterworth). 
Bowpen, F. P. and Tasor, D., 1950, The Friction and Lubrication of Solids (Oxford: Univ. 
Press). 
Crook, A. W., 1957, Proc. Instn. Mech. Engrs. Lond., 171, 187. 
1958, Phil. Trans., A. 250, 387. 
1961, to be published in Phil. Trans. A. 
Dawson, P. H., 1961 (private communication from Research Department A.E.I., Man-- 
chester). 
Dowson, D. and Hiccinson, G. R., 1960, 7. Mech. Engng. Sct., 2, 188. 
Grusin, A. N., 1949, Central Scientific Research Institute for Technology and Mechanical 
Engineering, Book No. 30, Moscow (D.S.I.R. Translation). 
Suortter, B. A., 1958, Proc. Int. conf. Gearing, Instn. Mech. Engrs. Lond., 120. 
Wetsu, N. C., 1957, %. Appl. Phys., 28, 960. 


See also Proceedings of Conference in Lubrication and Wear, 1957, Instn. Mech. Engrs. 
Lond. 


The Author : 


After a Ph.D. degree for work on the optical properties of thin films, Dr. A. W. Crook. 
joined the Aldermaston Research Laboratories of Associated Electrical Industries Ltd in 
1948 where he has carried out fundamental studies related to some aspects of mechanical 
engineering. He has published papers on optics, the impact of metals, wear and lubrication 
and has been interested in the development of measuring instruments associated with the 
cutting of gears. He is a Fellow of the Institute of Physics. 


Two International Colour Meetings 


by R. W. BRocKLEBANK 


Goethean Science Foundation, Clent, Stourbridge, Worcs. 


T'wo international Colour meetings held in May 1961 have given an oppor- 
tunity for scientists working in this field to meet and compare notes. The first 
was the Maxwell Colour Centenary in London, organized by the Colour Group, 
with the Institute of Physics and Physical Society, and the Inter-Society Colour 
Council (of the U.S.A.), to celebrate the first demonstration of colour photo- 
graphy; it was deliberately kept informal as its primary purpose was an Anglo- 
American get-together, and its twenty-odd invited papers consisted mostly of 
reviews of progress in the field over the last hundred years. The second 
meeting was on a larger scale, held in Diisseldorf, and organized jointly by the 
Fachnormenausschuss Farbe im Deutschen Normenausschuss (FNF) and the 
Centre d’Information de la Couleur (C.I.C.) of France, being the fifth in a 
series of Journées Internationales de la Couleur. More than eighty papers 
covered a range of topics from colorimetry and colour vision to problems of 
education and the use of colour in fashion, decoration, and advertising. 


THe MAXWELL CENTENARY 


Rather concise versions of the Maxwell Centenary papers were preprinted 
in The Journal of Photographic Science (7. Phot. Sci. 9, No. 2, March-April 
1961), but three of the papers were not included, and deserve mention. ‘The 
opening contribution was a paper sent by R. M. Evans, of the Eastman Kodak 
Company, Rochester, N.Y., describing in detail a reconstruction of Maxwell’s 
demonstration. ‘This was based on particulars found in the notebook of 
‘Thomas Sutton, Maxwell’s assistant, who actually carried out the photographic 
part of the experiment, but involved also a good deal of detective work to fill in 
missing details. Apparently Maxwell was not in the least interested in colour- 
photography as such, but was mainly concerned to show that all hues could be 
produced by mixtures of just three colours, and that green, not yellow, was the 
best choice to go with red and blue for such a task. Thus he instructed Sutton 
to prepare blue, green, red and yellow separation prints of a colourful display of 
still life, without realizing that the photographic material at that time was 
absolutely insensitive to radiations in the green, yellow or red parts of the 
spectrum. In fact, Evans has found that the wet collodion with silver iodide 
preparation as used by Sutton is sensitive to short-wave radiation up to about 
430 mp, where there is a sharp cut-off. Nevertheless, Sutton proceeded with 
his task and made blue, green, yellow and red separation transparencies, using 
the appropriate coloured liquid filters, of an arrangement of coloured ribbons 
with a tartan design. ‘The blue separation presented no difficulties, and Sutton 
recorded an exposure of 6 sec in bright sunlight, using a copper sulphate 
solution for the filter. ‘The green separation, using a copper chloride solution 
and an exposure of 12 min, was at first a complete blank, but a second attempt 
with a more dilute solution succeeded in producing an image. The yellow 
separation was made without difficulties, using a yellow glass filter, but then 
seems to drop out of the story. Lastly, the red separation, using a solution of 


International Colour Meetings Madi, 


iron sulphocyanide as a filter, was made with an exposure of eight minutes. 
Evans has reconstructed the liquid filters and measured their spectral trans- 
mission characteristics. ‘The blue solution passes short-wave radiations quite 
adequately, while absorbing just those wavelengths to which the photographic 
emulsion was insensitive. The green solution, if not too concentrated, does 
transmit radiations to some extent just below 430 mp in wavelength, so the 
photographic plate could have recorded an image, and this would have a slight 
colour-difference from the blue separation, sufficient to produce a genuine 
colour-separation. But the transmission band of the red filter was definitely 
confined to longer visible wavelengths, and it is certain that the red radiations 
could not have produced a photographic image. Yet the image was there, and 
scrutiny of the original plates has shown that the differences between the red 
separation and the others is genuine colour-separation, and cannot be attributed 
to subsidiary effects such as contrast, density, or exposure. Evans has now 
produced a very satisfactory answer to this mystery. He has shown that the 
absorption bands in the red filter solution, and in all the fabric dyes most likely 
to have been used for dyeing the coloured ribbons, which give the red colour 
by blocking off the middle and short-wave radiations, fade out just about at the 
end of the visible spectrum. ‘Thus, in sunlight, the red of the ribbons would 
reflect ultra-violet radiation quite copiously, and this would be passed by the 
red filter and would be effective in producing a photographic image. The 
image so produced records the ultra-violet component of the scene, and in this 
case it happened to coincide quite well with the red component. ‘The complete 
picture was thus analysed in terms of blue, greenish blue, and. ultra-violet 
components, and these were projected in blue, green and red lights, respectively, 
for the re-synthesis. Since Vogel did not discover colour sensitizing dyes until 
1874, Evans concludes that Maxwell and Sutton invented three-colour photo- 
graphy twenty years before it was possible. "This does not mean that the Centenary 
is spurious, because in principle Maxwell was perfectly correct, and to this day 
all successful methods of colour photography use a three-component analysis 
into short, middle and long wave regions, followed by a re-synthesis with three 
suitable colorants; unfortunately the muddle as to whether the proper companion 
for red and blue is green or yellow (or both) still continues. 

If Maxwell’s photography was a little suspect, there is no question about the 
excellence of his colorimetry. D. B. Judd, of the National Bureau of Standards, 
Washington, D.C., at the end of his review paper, remarked that Maxwell, 
although lacking the advantages of convenient analytical expressions and co- 
ordinate paper, produced in a few years (on what was merely an interesting 
sideline) a determination of spectral colour distribution that compares favourably 
with the best modern determinations. One interesting feature of Maxwell’s 
colour-matching methods was that he always worked with a white matching 
field, whereas the modern technique is to use a coloured field as highly saturated 
as possible. Comparison of these two methods under the same conditions has 
led to the discovery of what is virtually a new effect in colour vision by B. H. 
Crawford of the National Physical Laboratory, ‘Teddington. In a paper 
describing this discovery, Crawford summarized the two methods in a set of 
equations; in Maxwell’s method the white is first matched by a mixture of 
quantities /,m,n, of the three chosen spectral primaries: 

I[r] + m{g] + nb] =W. 
CEP. 


274 R. W. Brocklebank 


Secondly, a new spectral radiation is substituted for one of the primaries, and 
a new match is made against the same white: 
x[r] + y[g] + 2[b’] = W. 
Then the value of the new spectrum colour can be found in terms of the three 
chosen primaries by substituting in the equation: 
a[b']=(/—»)[r] + (m—y)[g] + n[b]. 
The procedure is repeated for each spectral colour, always substituting for that 
primary which will result in a triangle enclosing the white point. Naturally, if 
the quantities ],x, or m,y, are large and near, the result, depending on their 
differences, will be low in precision. Largely for this reason, the modern 
method is to desaturate the spectral colour being measured with just enough of the 
‘ opposite’ primary for it to be matched with a mixture of the two ‘ adjacent j 
primaries: 
3[b’]+/[r] =m [g] +n'[b]. 

Thus the value of the spectrum colour is given immediately in terms of the three 
chosen primaries: 

2'[b’]= —/'[r] 4+ m'[g]+n'[b]. 

Crawford’s apparatus was a revised version of the Wright colorimeter, and 
could be used to give absolute measures of spectral power as well as visual 
matches. With a 10° visual field, each of five observers made matches of points 
throughout the spectrum with each of the two field conditions, the Maxwellian 
white field and the saturated-colour field; the chosen three primaries were of 
course the same for each set of matches. ‘The experimental results give 
apparently more saturated points for spectrum colours from Maxwell’s con- 
ditions than from the ‘ modern’ conditions; the differences are greater for some 
observers than for others, but are all in the same direction. One observer, with 
a wide divergence between the two sets of results, reported a certain difficulty 
caused by the appearance of the Maxwell spot, which was very obvious in the 
10° white field, whereas another observer, with a small divergence, saw no 
Maxwell spot; however the next observer got a large effect, but never saw a 
Maxwell spot. The conditions of observation—daylight illumination of 
high intensity, with the eye kept light-adapted—make it unlikely that rod-vision 
intrusion is responsible for the discrepancies. As yet it is impossible to say 
what might be the cause of the effect, though there is no doubt that it is a real 
one; it might be related to the discrepancies between colorimetric functions 
for 10° and 2° fields found by Stiles and Wyszecki (in which there was also a 
certain degree of difference in saturation between the two conditions) and there 
is some hope that a further investigation may disclose a factor or function which 
converts from white-field conditions to saturated-field conditions, so that 
Stiles’ results may be put to a practical use. Crawford’s first results have 
suggested many lines of investigation; some would concentrate on finding the 
cause of the effect, others would put the emphasis on extending the range of 
practical colorimetry to cover a wider set of conditions. Exact colorimetry is at 
present confined to the 2° field condition, and it depends absolutely on the 
additivity of colour matches; the discrepancy between white-field and saturated- 
field conditions is of course a break-down of additivity, but this discrepancy 
has not yet been shown for a small field. Even for the larger field, it is possible 
that it will be found that additivity holds to a near enough approximation over 


International Colour Meetings 275 


certain restricted areas of the colour domain (e.g. near-whites) for colorimetry to 
be a usable, if somewhat cumbersome, technique. In the other direction, it 
may turn out that the simplest kind of explanation for these, and other similar 
effects will demand that the spectral sensitivity of the receptors in the eye must 
be changed if the conditions are changed, although, as Judd remarked when he 
drew this conclusion, this is no longer a popular idea. 

The third unscripted paper was given by W. A. H. Rushton, of Trinity 
College, Cambridge, in place of his review paper because what he really 
wanted to speak about was something else: an account of his latest results 
in providing a physiological basis for the trichromacy of vision. Rushton’s use 
of the ophthalmoscope for retinal densitometry is by now well known, and he has 
himself written an account of his work for the non-specialist quite recently 
(W. A. H. Rushton, ‘The Chemical Basis of Colour Vision”, New Scientist, 
Vol. 10, No. 235, pp. 374-377, 18 May 1961), but it is worth reporting his 
two latest findings. A greater availability of deuteranopic subjects, together 
with a more careful interpretation of the results, has led him to retract his 
opinion that deuteranopes had two visual pigments active in the spectral region 
500-600 mu; he now believes that his evidence shows them to have only one 
pigment, the normal red-sensitive pigment, just as protanopes have only one 
normal green sensitive pigment active in this region. Secondly, by screening 
150 medical students with a blue-yellow flicker device designed by Wright, he 
was able to select 8 with a good blue : yellow ratio, that is, with eyes more than 
usually free of yellow pigmentation and hence more sensitive to blue light; and 
with these subjects he has obtained some results which indicate the presence of a 
specific blue-sensitive pigment. Further, this pigment is absent from the centre 
of the fovea, which is known to be blue-blind. ‘Thus Rushton has produced 
objective evidence for the presence of three colour-sensitive pigments in the 
retina, first conjectured by Thomas Young, though the names he has given them 
(erythrolabe, chlorolabe and cyanolabe for the ‘ red-taking ’, ‘ green-taking ’ and 
‘blue-taking’ pigments, respectively), will surely add to the muddle about the 
three ‘ proper ’ primary colours. 


DUSSELDORF 


In Diisseldorf, Dr. Judd opened the colorimetry session with a paper 
describing the blue-glass filters that have been made to approximate the black- 
body at 6500° k, when used with an incandescent lamp operated at a colour 
temperature of 2854°k (CIE source A). The hope is evidently to provide a 
more practical alternative to the existing CIE source C, which consists of the 
same lamp coupled with a pair of liquid filters, and to lead eventually to the re- 
definition of source C; indeed, such a proposal has already been put forward for 
consideration by the CIE Committee concerned, which has, after prolonged 
discussion, deferred its decision until 1963. The proposals came from the 
Russians, who have a filter that they believe to be adequate for the purpose, a 
three-layer blue-glass filter of the Ronis type, as described by Rautian, 
Lobanova and Znamenskaja in 1956. Judd attempted to purchase such filters 
in 1959, but found that they were not commercially available; at his request, the 
Corning Glass Works, New York, undertook to design and produce a filter of 
this type, under the direction of A. J. Werner, and at the end of 1960 delivered 


276 R. W. Brocklebank 


a supply of them. Measurements at the National Bureau of Standards have 
shown that the Corning three-layer filter more nearly duplicates the black-body 
energy distribution for 6500°K than do the data reported for the Ronis-type 
filters. ‘The Corning filters are available for purchase (colour specification 
number CS-1~71), and so can be used by any laboratory anxious to have simu- 
lated 6500°x full radiators. Before any decision to alter the CIE specifications 
is made, there are a number of points to settle. Firstly, is the Planckian 
distribution really the target to aim at? Since the colorimetry of materials is a 
practical problem, the need is for a standard illuminant approximating to 
practical illuminants. Source A is in fact an actual incandescent lamp of the 
type widely used for artificial lighting. Sources B and C were originally 
suggested as alternatives for a standard type representing daylight, B correspond- 
ing to the colour of direct sunlight, while C included the contribution of blue 
sky, and both used filters which attempted to follow the relative energy distri- 
bution of these natural sources. This could still be the target. Secondly, if a 
change is to be made, should it not also include provision for factors that have 
recently come to be important in colorimetry, though not considered when 
these standard illuminants were specified in 1931? One of these factors concerns 
the colorimetry of fluorescent materials; it would be an obvious advantage if the 
same standard illuminant could be used for all surface colours, but with 
fluorescent substances the ultra-violet radiation of the source has also to be 
standardized. An incandescent lamp is unsatisfactory as a source of ultra-violet 
radiation, but there is a promising alternative in the xenon arc lamp, which has 
in general a very good spectral distribution. Certain objections to the xenon 
arc, such as its instability and possible danger factors, might yield to develop- 
ment, and there is hope that if a quick decision in favour of the Russian 
proposal can be avoided, a new standard illuminant could emerge with very 
wide applicability and a high practical value. It would seem that although in 
1931 failure to agree led to the adoption of two ‘ natural-colour’ illuminants 
instead of one, the new standard could stand alone, and allow source B to fade 
out of use, with the prospect of greater ease in comparing data, at present 
hindered by the fact that some experiments are done with one illuminant and 
others with the other. 

‘There is today a widespread tendency to regard the concept of colour as 
belonging more to the sphere of perception and rather less rigidly to selectivity 
among radiations, but perhaps few would go as far as Professor V. Ronchi of 
Florence, who argued that light and colour exist only in the psyche, and that 
material bodies have neither, therefore colour cannot itself be the cause of 
psychological emotions. It may be that his attack was not as devastating as it 
appeared, and that he was only asking for a revision of terminology. The fact 
is that colour perceptions, however subjective they may be, are not at all arbi- 
trarily related to the physical radiations responsible for light, and it is quite 
reasonable to refer to the colour of lights or the colour of objects, even though 
it may be thought that electro-magnetic disturbances and atomic clouds are all 
that is ‘really there’. After all, nobody objects to the use of concepts of length 
or size in physics, yet these too are derived in the first place from subjective 
perceptions. One still finds confusions between metric size and angular size 
in certain circles, for instance in the argument whether the moon is as big as a 
sixpence or a dinner-plate, though the concepts are clear enough if applied 


International Colour Meetings IAD | 


correctly. Similarly in the field of colour, the concept of colour as a characteris- 
tic of visible radiations (as used in colorimetry) is distinct from the concept of 
colour as an aspect of visual experience (as used in experimental psychology) 
while still being related to it, though in a more complex fashion than the relation 
between metric and angular sizes. It may be that one psychological effect 
cannot itself cause another psychological effect, but there is no reason to suppose 
that colour, in the sense of a characteristic radiant flux, cannot cause psycho- 
logical effects other than the obvious one of a colour sensation. In this 
connexion, a paper given by R. M. Hanes, of Johns Hopkins University, Mary- 
land, is of interest. He spoke on “‘ Fact and Fancy Concerning Derived Re- 
actions to Colour”’, and noted that facts in this area were particularly scarce. 
These derived reactions are cases where the subjective impression includes 
effects outside the sphere of colour, yet apparently produced only by the colour 
of the stimulus. The best known case is that of warmth and coolness; this is 
sufficiently well attested to rate as a fact. It is the hue of a colour, more than 
its brightness or saturation, that contributes to impressions of warmth or 
coolness, with a fairly clear indication that orange is the warmest hue while the 
coolest ranges through the blues and greens. Other such reactions about which 
there is general agreement indicate that dark colours appear more ‘ heavy’ 
than light colours, and also that lighter objects look larger than darker objects 
of the same dimensions. Hanes reported a large-scale experiment that was 
designed to investigate the effect of colour on apparent distance. He showed 
that though the familiar belief that red surfaces ‘ advance’ and blue surfaces 
‘recede’ may often be true, it is likely that the effect is due to the luminance of 
the surface rather than its hue. He constructed a full-size room, 12 ft by 22 ft 
and 8 ft high, with the end walls fitted on rollers and a smooth luminous ceiling. 
The subject sat in an armchair in the middle of the room facing one long wall; 
this was painted a medium grey, as was the other long wall and one of the end 
walls, to one side of him. The other end wall was painted in turn with seven 
other colours, including black and white, and in each case the end walls were 
moved until the subject reported that they both seemed to be the same distance 
away. Different groups of observers were given different instructions, but it was 
found to be a fact that colour did produce an effect. All the colours advanced 
except black, and the amount of advance was proportional to the luminous 
reflectance of the colour. It seems that many of the popularly accepted beliefs 
about these derived reactions are generalizations based on a few cases, and that 
further strict investigation would reveal underlying facts, not necessarily 
identical with the popular belief, but supporting particular instances of it. Such 
studies cannot uncover the origin of these reactions, which may be learned 
associations, or perhaps a result of ‘confusion’ in the brain between very 
similar patterns of nerve activity stimulated by separate sense organs; but for 
practical problems of using colour for decoration or publicity, the mechanism 
of the reaction is of no account—what matters is whether it really is a fact or not. 
There is evidently plenty of scope for further carefully controlled experiments 
to investigate the many psychological effects connected with colour. 


The Laws of Electromagnetism and an Electrostatic Analogue 


by J. ALAN CHALMERS 


University of Durham 


SUMMARY 


Starting from the idea of a current-carrying coil as a magnetic ‘ dipotential ’, i.e. a 
dipole of constant potential difference, the relationships between the electromagnetic laws 
of Ampere, Biot-Savart, Grassmann and Faraday are discussed. 


1. INTRODUCTION 


Different text-books on electricity and magnetism have used various methods 
of relating the laws of electromagnetism, but there remain difficulties whichever 
method is adopted. The present work is an attempt to put elementary work 
on a more logical basis without elaborate mathematics, by a greater use of the 
concept of magnetic potential. While it is not claimed that anything funda- 
mentally new will be found here, it is hoped that the ideas will be useful for 
teaching. 

There are three fundamental phenomena of electromagnetism: 

(a) the force acting on a moving charged particle, or on a wire carrying a 
current; 

(b) the magnetic field produced by a moving charged particle, or by a wire 
carrying a current; a combination of (a) and (bd) gives the force between wires 
carrying currents; and 

(c) the induced E.M.F. produced in a conductor when there is a changing 
magnetic field. If these three phenomena were entirely independent, we should 
expect that there would be independent constants involved in each of the for- 
mulae concerned, just as electrostatic and gravitational forces between two 
bodies involve the two constants «, and G respectively. But, in fact, the laws of 
electromagnetism involve only the single constant jy. 

As long as the medium in which the charges or currents are situated remains 
the same, matters are comparatively simple; but once we compare effects with 
different media, complications arise. In an earlier work (Chalmers 1961), 
the author showed that the correspondence of the vector H to E, and so of B to 
D, can be maintained if a current-carrying coil is considered to behave as a 
‘ dipotential ’, i.e. a dipole of constant potential difference, rather than as an 
ordinary dipole of constant charge (the term ‘ constant’ referring to effects of 
change of medium); this same idea will be used in the present work. This 
correspondence between H and E can be conveniently expressed by the idea 
that the unit of magnetic potential is the ampere, while that of electrostatic 
potential is the volt; thus analogies to electrostatic relations can be found by 
replacing E in volts/metre by H in amps/metre. 

The following treatment is not complete, in that familiar arguments (to be 
found in text-books) are only quoted, whereas in a complete logical sequence 
they would have to be given in full. 


The Laws of Electromagnetism and an Electrostatic Analogue 219 


2. THE MAGNETIC DIPOLE 


We shall here take as the fundamental basis of electromagnetism the law 
which gives the force exerted on a moving charged particle in a magnetic field. 
This can be expressed: 

F=q(v x B) 


where F is the force acting on a charge g moving with a velocity v. This 
equation defines the magnetic induction vector B. 

Alternatively, 

F=7(s x B) 

where F is now the force acting on an element s carrying a current 2. 

Considering a plane loop of area A carrying a current 7 and situated so that 
its normal makes an angle @ with the direction of B, it is well known that there 
is a couple G tending to alter 6 of magnitude: 


G=i7B sin 0: 

Now, in an electrostatic field of strength E, an electric dipole of ‘ electric 
moment’ M, situated with its axis making an angle 6 with the direction of E, 
experiences a couple 

G'=M E sin 0. 

The magnetic vector corresponding to the electric vector E is H, so we must 

replace B by pH; so 
G=piA E sin 0. 

Thus the current-carrying loop behaves as a ‘ magnetic dipole’ with its 
axis normal to the loop and of ‘ magnetic moment’ m=piA. Since the magnetic 
moment contains the factor p, it is altered on change of medium and it is, in 
fact, a ‘ dipotential ’. 

In previous discussions, it has been found sufficient to deduce m, which 
gives the couple on the loop. But, just as an electric dipole consists of charges 
+O separated by a distance /, where M=QI, so a magnetic dipole could be 
considered to consist of magnetic poles of strength p, separated by a distance 
d so that 

m= pd. 

This division of m into p and d is logical for a permanent magnet but when 
we try to apply it to the intangible ‘ magnetic shell ’ which behaves like a current- 
carrying coil, it is not possible to give any immediate interpretation of the 
distance d and it has been usual to think of d as infinitesimally small, while m 
retains the finite value derived from the couple on the loop. 

From the electrostatic analogy, p must represent the total number of lines 
of B which emerge from the loop. As will be discussed later, this is a measurable 
quantity since, if p=Ti, L is the self-inductance of the loop. We may note, for 
later reference, that L=A/d and so is independent of z and proportional to p. 

If we consider m to be factorized into p and d, it is reasonable to assume that 
it is p and not d which alters as the current alters, since d is a geometrical length 
which depends on the geometry of the circuit. [his assumption is verified by 
the experimental fact that L, the self-inductance, is a constant for any one circuit, 
independent of 7. Thus p= Jz is justifiable. 

If we have an electric dipole, with charges +@Q covering an area A and 
separated by a distance J, it can be considered as a parallel-plate condenser. 


280 J. Alan Chalmers 


So we can consider our magnetic shell as analogous to a parallel-plate condenser 
of area A and separation d. 


3. AMPERE’S LAW 


If a current-carrying loop behaves as a magnetic dipole as regards the force 
or couple experienced by it in a magnetic field, then it is a reasonable assumption 
to make that it will also behave as a magnetic dipole as regards the magnetic 
field produced by it. 

In the electrostatic case, for a parallel-plate condenser carrying charges 
+O on plates of area 4 with separation d, the potential difference between the 
plates is: 


V,—V.=Odled. 


Applying the same result to the analogous case of the magnetic shell, for the 
difference in magnetic potential y% we have 


by —Y.=pd|pA. 


pd=m=ywiA 


But 


y= py. =0. 


For an electrostatic parallel-plate condenser, the potential difference is the 
same whether we move by a path straight across the condenser or by an external 
path (see Fig. 1), and, analogously, the same would be true for a magnetic 


Fig. 1. Potential difference across a parallel-plate condenser. 


shell composed of magnetic material or a magnetic ‘condenser’ with two 
sheets of opposite magnetic poles. But with the magnetic shell that is equivalent 
to a current-carrying coil, there is no material present. Since d¢/dx=—AH, 
which is finite, 5y for the small distance d must be small. Thus there can be 
little difference of potential if we go from one ‘side’ of the shell to the other 
through the shell; but if we go round outside the shell, we get the value b,—p.=2 
as given above. Completing the path through the shell, we get the total change 
of potential going round a path encircling a current equal to 7. Further 
discussion of the distinction between this case of a many-valued potential and 
the simpler one of a single-valued potential is given in many text-books. 


The Laws of Electromagnetism and an Electrostatic Analogue 281 


Electrostatically, 
2 
a a As — [# ds 
L 


and so, similarly 


ba—te= | He ds. 


Taking the integration right round the path which encircles a current 7 we 
obtain Ampére’s law, 


fH ds =1. 

We see that Ampére’s law is derived here directly from the equivalence of a 
current-carrying loop with a dipotential and not, as in many text-books, either 
given as a result without proof, or obtained as a generalization of results from 
the Biot-Savart law, itself assumed without proof. 

It may be noted that, if we consider the magnetic potential to be the impor- 
tant quantity, then H is merely a convenient way of describing -grad wp. 


4. THe Brot—Savart Law 


Although Ampére’s law is a simple expression giving the field due to a 
current-carrying conductor, it can be applied directly in only a few simple cases. 


Glsx 


Fig. 2. Field due to straight wire. 


For more general use, we require a formula which gives the contribution to the 
field due to a current element, so that we can add vectorially the effects for all 
current elements concerned. 

If a current-carrying loop behaves as a dipotential, we can consider the field 
produced by it at a distant point in the plane of the loop. We know, from simple 
magnetostatics, that the field at a distant point perpendicular to the axis of a 


282 J. Alan Chalmers 


magnet is parallel to the axis (Gauss position B). Now, dH due to an element 
of the loop can depend only on di, the length of the element, 2 the current in the 
element and 7, the line joining the element to the point in question. Both d/ and 
r lie in the plane of the loop and H is perpendicular to this plane, so it appears 
likely that a vector-product law may be the relation. We therefore try: 

dH = Ki(d/ xr) 
where K is a scalar factor which involves the distance r but not 7 or dl. 

It would be possible, at this stage, to verify this result and obtain a value for 
K if we apply this to a suitably shaped loop, e.g. a rectangle, calculate H for the 
distant point and compare it with the result for the Gauss position B, but a 
simpler method is available. 

Consider a long straight wire carrying a current 7 (see fig. 2). At a point P, 
distant y from the wire, H must be perpendicular to the plane of the diagram 
and by going round a circle of radius y, fHs ds=27yH =i from Ampére’s law. 

= 1/2ay. 
For the element dx, the contribution to H is: 
dH = Ki dx r cos 0 
now, 
dx =r sec 0 dé and y=r cos 0. 
Thus H contains a factor 1/r and dH a factor Kr®; so, to give consistency, K 
must contain a factor 1/r°. 

Putting K=k/r® and integrating dH to give H, it is easily shown that k= 1/47. 

Thus 

L 
4cr3 

This gives the Biot—Savart law derived directly from the equivalence of a 
current-carrying loop with a dipotential, together with the assumption, or 
inspired guess, of a vector-potential law; in most books, the Biot—-Savart law 
is assumed without any definite basis. 

From the basic law in the form 

F=7(s x B) 


we can get the force between two current elements as Grassmann’s formula: 


dH =——_(dI x r). 


Fat (di, x(dl,xr)) 


4ar 


Because of difficulties in relation to Newton’s third law of motion, other 
forms of this relation have been suggested and these would require different 
laws in place of the Biot—Savart law. However, these difficulties are surmount- 
able (Page and Adams 1945) and there is no need to consider such other forms, 
which would not involve a vector-product law. 


5. Lenz’s Law 


It may be taken as an experimental fact that an induced £.M.F. is produced 
in a circuit while the total magnetic induction through the circuit is being altered, 
whether the change of induction is produced by a changing current in the same 
or another circuit, by relative motion of two circuits, or by motion of a permanent 


The Laws of Electromagnetism and an Electrostatic Analogue 283 


magnet. Lenz’s law, which may be derived from Le Chatelier’s principle, 
states that the direction of the induced current is such as to produce magnetic 
induction in a direction tending to keep the total magnetic induction through 
the circuit the same, opposing the change which is being produced by the chang- 
ing current or by the relative motion. 


6. FarRADAY’s Law 


We shall first consider the case of the induced £.M.F. produced by a changing 
current in the circuit concerned. 

In an electrostatic parallel-plate condenser, the energy is } OV. 

For the corresponding magnetic shell, p replaces O and 7 replaces V, so the 
energy is: 

W=t pi 
ci a be 

since p= Li. 

The rate of change of energy is 


dW _,. di 
dt dP 
But, if we have a circuit with a current 7, the rate of working is Hz, where 
E is the £.M.F. in the circuit. 
Thus we get £ corresponding with Ldz/dt and, from Lenz’s law, E= —Ldi/dt. 
As we have already seen (§ 2), p represents the number of lines of induction 
emerging from the loop and so, if L remains constant: 


a7 edpy sdiy 
POR OLS sare 

So we get E= —dN/dt, Faraday’s law for this case of self-induction, deduced 
in a way which is more simple than that usually given. 

In order to extend this result to cases where the lines of induction are due, 
not to the circuit in which the induced E.M.F. is produced, but to another circuit 
or to a permanent magnet, we consider a dipole of moment m in a field H. It is 
easy to show that the potential energy is 


W=mH. 


But m=piA and, if N=BA, the number of lines of induction through the 
circuit, 


W=1N. 
If N is altered but 7 remains constant momentarily: 
dW .dN 
ASCE 
giving, as before: 
E=— a 


284 J. Alan Chalmers 


Returning to the single loop, we see that m is easily obtained from the 
couple on the loop. If the self-inductance, L, can be measured then we have: 
m uA pA 
p eile 2 
and hence it is possible to obtain a value for the ‘ length’ of the dipole. It is 
easy to verify that, taking into account the number of turns, the well-known 
formula for the self-inductance of a long solenoid gives the ‘length’ of the 
dipole equal to the length of the solenoid. 


d= 


7. THE EFFECT OF THE MEDIUM 


When the medium in which electromagnetic effects occur is altered, there 
is a change in the ratio of B to H and it is therefore important to consider which 
of these enters into the formulae. It is always stated, without any further 
discussion, that the force on a moving charge, or on a current, is proportional 
to B rather than to H. But this is not susceptible of direct proof since we do 
not know of any medium, through which a charge or wire can move, 
which hasa permeability sufficiently different from that of a vacuum fora test. 

If we attempt to consider the consequences of the assumption that the force 
would be proportional to H, not B, when the medium is changed, we could 
write: 

F=q(v x H). 


This would give the wrong units for H and so we should then have to correct 
this by inserting a constant of the appropriate dimensions, which is in fact the 
constant jy. 

Following the same argument as above, we would get: 

pBsds =p)1. 

The loop would then behave as a true dipole, not as a dipotential and 
Ampére’s law would no longer be true. 

We would also have: 

m=pd=Lid=iAu, 
so that 
A 
) pm 
d 

and L would be independent of the medium. It is possible to measure the 
value of L without requiring any motion in the medium and it is well-known 
that L increases with the permeability of the medium. Working back from this, 
we see that the force on a moving charge must be proportional to B, not to H. 


8. CORRESPONDENCE BETWEEN THE VECTORS 


The above has all been written from the point of view, discussed in the 
earlier paper, that H is analogous to E and Bto D. Several writers have taken 
the opposite view and it is therefore worth while to consider how that view would 
fit into the present discussion. 

With B corresponding to E, and a loop experiencing a couple G=iAB sin 8, 
the loop behaves as an ordinary dipole of moment m’=iA. 


The Laws of Electromagnetism and an Electrostatic Analogue 285 


If we now put m’=p’'d and p’ = Li, we have L’i equal to the number of lines 
of Hi: 


Using again the analogy of the parallel-plate condenser, we get: 


»_pdp 
8 
since now e is replaced by 1/y. 
Thus 
fb’ =. 
But #’ is now $Bds 
and so 
pds =1. 


Thus Ampére’s law is derived, although not so simply as before. The later 
parts of the argument, in the same way, can also be derived from considering 
B to correspond to E, but as with Ampére’s law, the results do not arise so 
simply. 

One can conclude that, as far as the present discussion goes, it is not wrong 
to consider B as analogous to E, but it is far less convenient than to take H to 
correspond to E, with the idea of the current-carrying coil as a dipotential to 
obviate the difficulties; the present discussion therefore adds to those given in 
Chalmers (1961) that the correspondence of H with E and of B with D is the most 
convenient concept. 


REFERENCES 


Cuatmers, J. A., 1961, The Correspondence between a Current-carrying Coil and a 
Magnetic Dipole, Contemp. Phys. 2, 472. 
Pace, L., and Apams, L. I., 1945, Amer. F. Phys., 13, 141. 


The Author: ent 
Dr. Alan Chalmers, M.A., Ph.D., F.Inst.P., is Reader in Physics in the Durham 


Colleges in the University of Durham. 


Faraday’s Law of Induction and the Force ona Body due to 
Change in its Magnetization in an Electric Field ~ 


by K. G. RAMANATHAN 
National Physical Laboratory, New Delhi, India 


SUMMARY 


From arguments based on Faraday’s Law of induction, it is shown 
that a body whose intensity of magnetization is changing should, in general, 
experience a force when situated in an electrostatic field in a manner analogous 
to the force experienced by a conductor carrying an electric current in a 
magnetic field. 


An infinitely long bar of magnetic material whose intensity of magnetization 
is changing may be pictured as the carrier of a magnetic displacement current 
Im, which denotes the rate of displacement of magnetic poles across any section 
of the bar (fig. 1b). Faraday’s law of induction may then be interpreted as the 
creation of an electric field LE, by the magnetic current Im. In this way the 
analogy with Oersted’s discovery of the creation of a magnetic field H, by an 
electric current J, is demonstrated better (fig. 1a). The concept Im introduced 
here is inherent in Maxwell’s theory and has been defined by many writers, 
Drude (1900) and more recently Wilson (1950) and Schelkunoff (1947) to 
quote only a few, as 


In =_ a (1) 
a t 

where S is the area of cross section of the bar and B is its magnetic induction at 

any instant of time ¢. 

‘lo show that this interpretation is useful we shall replace the rate of change 
of flux in eqn. (1) by the induced electromotive force in any closed path surround- 
ing the rod. For a circular path of radius R, around the infinitely extending 
bar as axis, the electromotive force is 27RE, in terms of the electric field E. 
Substitution of this in (1) leads to the relation E=2J,/R, the relative directions 
of F and J», being as shown in the figure. This is analogous to the familiar 
relation H=2/,/R. ‘The more general expression for the electric field due to a 
magnetic current element J; . dl, at a point whose direction is inclined to that 
of the element dl, by an angle 0, can also be deduced as 


im. al. sin @ 
R2 : (2) 


Further, we may define an electromagnetic unit of magnetic current as the current 
which while flowing through an arc of unit length of a circle of unit radius 
produces at its centre an electric field of one electrostatic v/em. This corres- 
ponds to a displacement rate of 3.10‘ magnetostatic units/sec. 

The fact that the concept of magnetic displacement current is useful in 
giving a new interpretation of Faraday’s law of induction, suggests the possibility 
that it might be useful also in interpreting other electromagnetic processes 


Faraday’s Law of Induction 287 


involving the basic phenomenon of induction and also in predicting effects 
analogous to those observed with electric currents. One of the important 
effects due to an electric current J,, is the force F.=H I, |, experienced by a 
conductor of length J, when it is situated perpendicular to a magnetic field H, 
as in fig. 2a. This force which is caused by the variation in the Maxwell 
stresses around the conductor, exists even on a displacement current as evidenced 


aon 
es 
5 


S 


(a) (b) 


Fig. 1. Relative directions of I, and E compared with those of J, and H. (In the 
figures © indicates a vector directed towards the reader and (, a vector directed 
away from the reader. 


by the existence of a Lorentz force on a moving electric charge and is known to 
arise from the superposition of the magnetic field produced by the moving 
electric charges on the static magnetic field H. ‘Therefore, we may also expect 
to observe a corresponding force on a magnetic current Im, when it is situated 
perpendicular to an electric field E, due to the superposition of the electric field 
produced by Im. ‘The direction of Fm is perpendicular to both E and J,» and 
is as shown in fig. 2b. 


288 K. G. Ramanathan 


One of the several methods of calculating the magnitude of Fy is by assuming 
the electrostatic field E, to be produced by two plane parallel, infinitely extend- 
ing charge distributions of opposite signs and of uniform surface densities, 
+ E/27 and —E/2z, on either side of Im as in fig. 2b: ‘The effect often, 
is then the same as that of the charge distributions on Im. Now, since Im 
produces a finite electric field at every point on the charge distributions, the 


H i eis 
452 Se ee 
Ete sabes 2 S| 
N é S 
ee ee eee y 
(a) 


Fig. 2. Relative directions of /,,, E and Fy, compared with those of I, H and F,. 


interaction gives rise to forces which may be evaluated by integrating the effect 
over the entire areas of the charge distributions. The resultant force is zero 
in a direction parallel to J, but has finite components along E and in a direction 
normal to both E and Im. ‘The first gives rise to couples on the positive and 
negative charge distributions while the second results in a force on the entire 
distribution giving rise to field E. But, since the charge distribution is static 
the force would appear as a reaction on In given by Fm=E Im 1 ace 
Fm is in dynes when £ is in electrostatic units (stat volts per cm) By is in 
electromagnetic units and / is in centimetres. 


-__ 


Faraday’s Law of Induction 289 


As In, can only be produced momentarily by a single magnetization of a bar 
of magnetic material, the force Fi, is also momentary. An approximate 
calculation shows that under the conditions obtainable in the laboratory, its 
detection would involve a ballistic measurement of action (F’. dt), of the order 
of 10~® dyne/sec, which is too small to be detected in this way. Also, the 
stray magnetostatic forces called to play are likely to be much larger than the 
force under investigation. However, the force could perhaps be detected and 
measured by a method in which the magnetization is carried out at a high 
frequency and the action during every half cycle of magnetization is integrated 
by suitably reversing the electric field synchronously at the same frequency to 
produce a large mechanical force in any one direction. 

The author wishes to acknowledge the benefit of a discussion he had with 
Dr. D. Shoenberg, F.R.S., Royal Society Mond Laboratory, Cambridge, on 
this subject when he visited this laboratory last year. 


REFERENCES 


Drupe, P., 1900, Lehrbuch der Optik (Leipzig). 
ScCHELKUNOFF, S. A., 1947, Electromagnetic Waves (New York: Van Nostrand Co.), p. 66. 
Witson, W., 1950, A Hundred Years of Physics (London: Gerald Duckworth & 


Co. Ltd.), p. 143. 


The Author: 
Is the head of the low temperature division of the National Physical Laboratory of 


India. New Delhi. He is interested in the basic theories of Thermodynamics and 

> 

Electromagnetism. 
Cake 


10 


The NPL “Microwave Mekometer ” 


DisTANCE MEASUREMENT BY A POLARIZATION-MopuLaTeD LicHT—BEAM. 
For some time distances of 10 km or more, such as are met with in geodetic 
surveying, have been measured by a method based on the velocity of light. 
The amplitude of the light beam has been modulated at a frequency of the order 
of 10? cycles/sec, setting up a modulation wavelength of about 3 m, and the 
distance is then measured in terms of the number of modulation wavelengths 
in it. 


BLOCK DIAGRAM OF N.P.L. MICROWAVE MEKOMETER 


HIGH VOLTAGE 
PULSE GENERATOR 
(lass PULSES, 50 
PER SECOND ) 


“RINGING” CIRCUITS 
DETECTORS & 
DIFFERENCE AMPLIFIER 


PHOTOELECTRIC 
DETECTORS 


’ Fig. 1. General plan of the Mekometer. 


The NPL microwave mekometer is a prototype of a new development 
designed for the measurement of distances greater than 10 m to an accuracy of 
a few thousandths of a centimetre; it makes use of a light beam subjected to 
elliptical-polarization modulation at microwave frequency, 9400 Mc/s. 

. Towards the end of the last century it was discovered by J. Kerr that when a 
dielectric is in an electric field it displays the doubly refracting properties of a 
uniaxal crystal whose optical axis is parallel to the field, and so when plane 
polarized light is incident upon it perpendicular to the field it emerges elliptically 


291 


The NPL * Microwave Mekometer” 


‘suuaysAs [vordo ay} Jo sflejaq °7 “81 


"WYO1937453¥ LINVISIOQ 3HL 4O NOILISOd 


BHL NO SQN3d30 ALISN3SLNI 3SOHM LHD! G3SIYVIOd 3NVId ere = =e SILIWOYDIW — 3d09S373L 


(Wsitd O8YOd) YOLD37434 
INVLSIO 3HL JO NOILISOd 3HL NO SNIGN3d30 G3IIIZDNVD YO 
Q3DNVHN3 SI NOILVSINVIOd 3SOHM LHOIT G3SINVIOd ANIWOlidiqI3 ------- 


“ADN3SNO3YS SAVMOXNDIW 3BHL 3DIML LV SONISH3ARY 


€ 


NOILVLOY YOLD3BA SO NOILD3YID LHOID G3SiINVIOd ATIVDIidI773 See -e 


4H9I1 GSSINVIOd 3NV1d TSS Se Hlvd LHOIT 


“LHOID G3SI8VIOdNH = ———-——— JTBVIDVA 


‘ 40193130 
; 218L153730L0Hd 


¥O15313d 
DIYLD3IZOLOHd OL 


$¥01337334 
—IW3S 


$¥O1931434 
Q3Y3ATIS LNOYS 


——— 83SATIVNY 


fa 
AWAiSAYuD 7 
BivHdSOHd Y] 
N3904NGAHIG 
WN INOWWYy- 


YVOLYNOS 348 
ALIAVD 
JAVMOY SIN 


WSIYd OUYOd > 


dOLS 
(yw ©0P6) Yy3aSIYVIOd 


NOYLINOVW 
a3asind 
ONV® — xX 


30H Nid 


38OUd ONINNL 
HSV14 NON3X 


YALSWONSW 3AVMOYD.IN Id N 


292 V. T. Saunders 


polarized. In the mekometer, plane polarized light is caused to fall on a crystal 
of ammonium dihydrogen phosphate in a microwave cavity resonator. Since the 
relative retardation, per unit thickness of the crystal, between the components of 
the elliptical vibration is proportional to the square of the field strength it follows 
that the optical effect remains the same when the field direction is reversed, 1.¢. 
the frequency of the modulation is twice that of the microwave, and so the 


wave-length of the modulation is known. 

A reflector placed at one end of the distance to be measured is used to 
return the elliptically polarized light back through the crystal, whereupon the 
ellipticity is increased or eliminated depending upon the phase of the returned 
modulation relative to that of the microwave in the crystal. By setting with the 
micrometer shown, the ellipticity may be eliminated in which case the final 
analyser will quench the light which is now plane polarized and this setting 
establishes a whole number of modulation wavelengths in the path (fig. 2). 

The device, which has been developed by Dr. K. D. Froome of the 
Standards Division, is not sensitive to background illumination and should 
work satisfactorily in full sunshine. V. T. SAUNDERS. 


The Teaching of Science and Mathematics 
The Physics Project under the Nuffield Foundation £250,000 Programme 


Sir David Eccles, Minister of Education, announced in Parliament on 4th April that 
the Nuffield Foundation had offered £250,000 towards the cost of a comprehensive 
long-term programme to improve the teaching of science and mathematics in schools. 

Though not responsible for the initiative in starting this programme, the Minister 
appears well-disposed towards it. 

The physics project is under the leadership of Mr. D. McGill, who has been seconded 
for the purpose by the Scottish Education Department. The Chairman of the Physics 
Consultative Committee will be Professor Sir Nevill Mott, who is Chairman of the 
National Committee for Physics Education set up by the Institute of Physics and the 
Physical Society. 

The range of aids in physics will include teachers’ guides, class texts, laboratory notes 
apparatus and visual aids such as films, film strips and wall charts, providing eoordingied 
sets of material which can be used by individual teachers in any way they wish. Such 
material will be welcomed by experienced teachers, while to the inexperienced teacher 
(or the one whose main interest lies outside the subject, but who is asked to teach it because 
of the present shortage of science specialists) its usefulness should be even greater. 

The physics project should be completed within three years. In the first phase, the teams 
will concentrate on producing preliminary drafts of the required teachers’ guides, class 
texts, and laboratory notes, and on commissioning the necessary demonstrates and 
laboratory apparatus and the relevant battery of visual aids for the course as a whole 
The Associated Electrical Industries Film Unit and Design Unit will assist in developing 
the visual aids. The Development Group in the Architects and Building Branch of the 
Ministry of Education are already investigating the design of science laboratories and it 
is expected that the Development Group of the new Science Equipment Consortium of 
local education authorities will collaborate in designing the apparatus. 

The second phase will assess the texts and other material in class, and submit them 
to further detailed criticism by physicists of the first rank and experienced teachers of the 
subject. In the final phase, arrangements will be made for the amended texts and the 
approved versions of the apparatus and aids to be produced on a scale sufficient to meet 
the expected demand. 


DSIR Research Establishments 


THE BuILDING RESEARCH STATION, GARSTON 


On Friday, 7th July, 1961, Sir Harry Melville, F.R.S., Secretary of DSIR, 
opened a new laboratory for research on materials at the Building Research 
Station, Garston. The Exhibition to mark the occasion featured mainly work on 
building materials; there were also some interesting exhibits dealing with the 
physics of the environment, heating, lighting and acoustics. 


1. THE HIGH TEMPERATURE MICROSCOPE 


This is an instrument for observing, and measuring the temperatures of, 
recrystallization, melting and polymorphic inversion in ceramics, slags and 
refractory materials up to 2300°c. 

The material to be examined is held at the tip of a thermocouple which is 
heated electrically to fuse the sample, and which also serves to indicate the 
temperature at which any observed change occurs in the melt. The heated 
assembly is contained in a cell designed for mounting on the stage of a polarizing 
microscope. 

A high-speed 50-cycle polarized relay connects the thermocouple in turn to 
the power supply B and to the measuring circuit A (fig. 1). The heating 
current flows only during alternate half-cycles, and the measuring circuit is 
connected during the intermediate half-cycles; provided that the changeover 


A 5Ock 
Vi 
C 
B 
20% 5% 
Rh-Pt Rh-Pt 


Fig. 1. To illustrate the independence of the heating and the temperature measuring 
circuits; A is the meter system, B a source of variable low voltage 50-cycle power, and 
C a polarized relay. 


contacts of the relay operate on the break-before-make principle the temperature 
measuring circuit will be isolated from the power supply. The heated portion 
of the thermocouple is a 5 per cent Rh—Pt/20 per cent Rh-Pt couple. 

The apparatus permits continuous observation of the phases in the melt and 
the provision for the use of polarized light enables the observer to distinguish 
between isotropic and anisotropic phases. 


294 DSIR Research Establishments 


2. WHERE’ER YOU WALK 


In order to be able to assess the resistance of floor materials to wear by foot 
traffic a machine had to be designed which would simulate the abrasive action 
of the foot, both in straight walking and in turning, on the level; and the first 
step, so to speak, was to find out exactly what this action is. 

The forces of walking were determined by a force plate, and these were 
related to measurements of the area of contact between the foot and the floor, 
recorded cinephotographically. 

The force plate unit (fig. 2) consists of a steel plate supported on four 
hollow columns of aluminium alloy which are fixed to the lower plate, and which 
carry paper resistance strain gauges. 


Fig. 2. The force-plate, removed to show the four recording columns and the zero-drift 
compensator. 


The fifth column, which is slightly shorter to keep it clear of the top plate, is a 
dummy carrying paper gauges to eliminate drift of zero balance in the circuit of 
the vertical component. ‘he gauges are connected in six Wheatstone networks 
feeding to recorders (see fig. 3) through amplifiers. A footstep on the upper plate 
compresses and bends the columns. ‘The conclusions were incorporated in an 
abrasion machine. ‘l'his should exert on a sample a vertical load of the order of 
250 lb acting on a shoe to provide a pressure of 20-25 lb/sq. in. At the same 
time it should provide a horizontal force on the shoe of 50 1b. The abrasive 
action should be produced by a rotation of this shoe through an angle not 
greater than 65° at a rate not greater than 7r.p.m. The maximum torsional 
couple that should be provided should exceed 130 in. lb. In addition a 
subsidiary impact tool, simulating the high point loads produced by the heel, 


is necessary. ‘This tool should apply loads of about 400 lb/sq. in. on an area of 
+ sq. in. or less. 


Such a machine was shown in operation. 


DSIR Research Establishments 295 


240 40 


VERTICAL COMPONENT 


(a) 


HORIZONTAL COMPONENT 
(b) 


ie) 
[e) 


1807 


Direction of 
walking 


FORCE (Ib) 
S 
S 
FORCE (Ib) 
Le) 
{e) 


o 
fe) 
i) 


AREA OF CONTACT 
(a) a / |. Total 


ne) 


40; (c) TORQUE 


Clockwise 


TORQUE (in. Ib) 
AREA (sq.in.) 


Anticlockwise 
N 


(e) 


CONTACT 0-3 
PRESSURE 


Vertical 


(f) 


component 
60 


aS 
(2) 


PRESSURE (Ib/sq.in) 
i\e) 
Oo 


% S) 0): 1S) BO-2s) SO) Ss) © © 10 1 20) 25 SOc 


TIME ( 40sec) 


Fig. 3. Typical series of force records—straight walking, right foot. (From National 
Building Studies, Research Paper 32 (H.M.S.O.). 


3. THE PERMANENT SUPPLEMENTARY ARTIFICIAL LIGHTING OF INTERIORS (PSALI) 


During the past hundred years there have been continuous demands for more 
and better quality lighting in buildings, and nowadays if 50 lumens/sq. ft can 
be got in a modern office by switching on the light, no one is prepared to work 
in poor and inadequate daylight. In particular in a deep room or in a room with 
a low ceiling there is often a wide range of daylight illumination. 

To provide uniform illumination in such rooms it has been found necessary 
to devise a system (PSALI) in which daylight provides a dominant source of 
lighting and artificial lighting adds illumination to the less well-lit parts of the 
room. 

The exhibit consisted of a model room which, with daylight lighting only, 
showed a wide range of illumination; in it there was installed a supplementary 
artificial system whose intensity could be varied by the observer. ‘The observer 


296 DSIR Research Establishments 


was invited to vary the artificial lighting until he considered that the room was 
uniformly well-lit. It was noticeable that there was close agreement between 
the settings arrived at by various observers. 
The investigations led to the conclusion that the average level of supple- 
mentary illumination needed is given (in lumens/sq. ft) by 
E,=D. L190; 


where D (per cent) is the daylight factor (minimum) and L (foot-lamberts) is 
the sky brightness as seen from the viewing point inthe room. Fora conventional 
side-lit room giving daylight factor of 1 per cent and a mean sky brightness (as 
seen through the window from the remoter parts of the room) of 500 ft lamberts, 
the level of PSALI would be approximately 50 lumens/sq. ft. 

Shown in connection with this exhibit was a daylight factor meter with a 
louvred mask so that it could be used by holding it out of a window and with a 
depolished Perspex capsule to correct the cosine response of the selenium cell. 


4. OTHER EXHIBITS 


I also saw but can only note briefly 

(i) an electron microscope used for the study of set cements, 

(ii) the procedure for investigating the adhesion of plaster to cement— 
techniques have been developed for measuring the strength of the 
mutual bond and the amount of movement required to cause failure of 
the adhesion, 

(iii) two methods of determining the size of pores and the distribution of 
sizes, in building materials—one method based on forcing mercury into 
the pores and the other on forcing a gas through a section of a brick, 

(iv) an examination of the increased corrosion caused by applied stress in 
alloys used for prestressing concrete—there were specimens showing 
the stress-corrosion cracks observed in hot nitrate solutions, 

(v) measurements of the thermal conductivity of samples and also of a 
whole wall under conditions of natural exposure, 

(vi) a small mobile laboratory in use measuring automatically round the 
clock the noise levels at 500 sites in London—a task undertaken in 
collaboration with the L.C.C. 


REFERENCES 


WE cH, J. H., 1954, ¥. Sci. Instrum., 31, 458-462. 
National Building Studies, Research Paper 32 (H.M.S.O.). 
Building Research Station Digest, June 1960, 135. 


V. T. SAUNDERS. 


The British Association Granada Lectures 1961 


The three 1961 Granada Lectures, on the theme “Communication in the Modern 
World ’’, were: 


‘The Language of Animals ’, by Sir James Gray, 
“Why Scientists Talk ’, by Professor Herman Bondi, and 
‘'The Gap—and the Bridge ’, by Sir John Wolfenden. 


Sir James Gray pointed out that language may be emotional, announcing fear, pain, 
matrimonial intentions; or it may be descriptive. The former is innate, the latter is learnt. 
Descriptive language requires a department in the brain for encoding the message, a trans- 
mitter, a receiver and a decoding centre in the brain. With José the chimpanzee on the 
platform, Sir James reminded us that she has, as we have, vocal cords, teeth, lips, tongue 
and lungs with which to blow air through them, in fact all the components of a transmitter ; 
she is also equipped with a receiver comparable with ours; but José has no encoding and 
decoding centres and so, even though she is an excellent mimic of human behaviour—she 
unlocked for us a trunk which had a complex system of locks and latches by imitating her 
keeper—she is quite incapable of talking. 

Dogs and seals can understand human speech to the extent of learning to respond in an 
orderly (and reward-seeking) way. Bees appear to have a facility which they alone share 
with man, a truly descriptive language. ‘‘ Clover honey is located, in considerable quantities, 
at a distance of approximately 800 yards on a bearing 45° west of the sun.”’ A bee can actually 
give this message to other bees in its hive. It does so by means of a dance performed in 
total darkness, which the recipients of the message detect with their antennae; this is a case 
of the communication of something more than simple emotions—precise information 
appears to be intentionally communicated. It is difficult to imagine how the elaborate 
totalitarian society of the bees could exist unless the individuals had such powers of 
communication. 

Parrots are popularly thought of as talking but this is mimicry, not intelligent speech. 
The various patterns of sound produced by birds have been subjected to audio-spectrum 
analysis but in no case is there evidence of descriptive language. 

Sir James gave the warning that even though evidence may be found suggesting the 
communication of precise information nevertheless the transference may be accidental or 
coincidental and not intentional. 


Professor Bondi mentioned first the nature of ‘ proof’ in science. If at any stage a 
forecast based ona scientific theory disagrees with experiment, the theory is thereby disproved; 
but however many cases of agreement of forecast with observation there may have been it 
cannot be held that the theory has been proved. Theories are communicated to others for 
checking and testing and “ it is in this readiness to cooperate, to listen to criticism and to 
agree that one’s point of view may be wrong, that the distinguishing feature of scientific 
work lies. In science, to be mistaken is not shameful; on the contrary, it is the human enter- 
prise that more than any other reckons with human fallibility.” 

The two principal ways in which scientists communicate ideas and discoveries are by the 
publication of papers and by talking. Published papers provide a permanent record for 
future research and teaching but publication is slow, and the printed word is so impersonal 
that it can give rise to the idea that a scientist is a cold, insensitive calculating machine. 
Talking is the personal, tentative, exploratory procedure; scientists travel all over the world 
to make personal contacts so that they may discuss their raw unformed ideas and the lines 
of attack on a problem. 

Referring to undergraduate education the lecturer reminded us that the medical student 
is, in fact, receiving a vocational training whereas the arts student is enjoying what might be 
called education. Somewhere between these two extremes there must be a balanced intel- 
lectual discipline. The science student appears to be at a disadvantage in comparison with 
the arts student in that the latter can, at a very early stage of his university career make useful 


298 The British Association Granada Lectures 


original contributions in discussions on matters which lie, still, in the field of uncer- 
tainty; whereas the former is unlikely to be able to do so until he has spent some years acquir- 
ing a background and a technique. What can be done to enable the science student to 
get through, in a shorter time than is at present possible, to the edge of the unknown ? 
Should the student’s course be directed along corridors rather than as at present over a 
wide front ? 

Should the art of communicating ideas be formally taught to students ? Professor Bondi 
suggested that at a time when the students are mainly concerned with organizing the 
inflow of ideas they are not receptive to instruction concerning the outflow. 


Sir John Wolfenden said that most of us, with the exception of painters, musicians, 
sculptors, communicate by using words; but history and usage have given all words under- 
tones and overtones of meaning, which can give rise to misunderstandings between speaker 
and hearer. 

The scientist tries to guard against this by inventing austere forms of expression. 
The mathematician has invented a range of signs, symbols and brackets which are un- 
equivocal, arange which in its own field and for its own purposes is ideal. At the other 
extreme there is ‘ the ordinary slovenly messiness of everyday conversation’. In between 
there is a whole spectrum of attempts at self-expression. 

In answer to the rhetorical question ‘ Where do we stand?’ Sir John said that that 
depended on whether our education had been based on grammar and literature and words 
or upon the expression of scientific concepts. 

Scientific communication operates at three levels. First, that of shared expertness, as 
when mathematicians confer; here there is a chance of perfect mutual intelligibility. The 
same is possible but less likely at a conference on one of the descriptive sciences. Neverthe- 
less mutual intelligibility is becoming harder as knowledge becomes more specialised and 
fragmented. At the second level the expert is addressing those who have a background 
of previous relevant information; here the speaker ‘ must be prepared to compromise with 
exactitude for the sake of being intelligible ’ and the listener must offer ‘ a receiving surface 
which will not distort that which is received’. ‘Then there is the third level—that at which 
the expert is talking to the ordinary man; and here the exposition will be less exact and the 
mutual intelligibility markedly lower than at the other levels; here is the ‘ Gap’ across 
which the scientist on the one side and the ordinary man on the other must try to throw a 
bridge. i 

At this third stage, there are urgent calls for a radical reform of our whole educational 
outlook and practice, for if scientists are illiterate and non-scientists are innumerate, the 
former must be taught letters and the latter must be taught numbers. This seems a 
doubtful solution. University syllabuses are quite full already and any attempt at adding to 
any of them must inevitably be accompanied by a subtracting; and if that is done to our 
science courses we may, by reducing the degree of specialisation, be making it more difficult 
for our experts to hold their own in this highly competitive world. 

In conclusion Sir John suggested that it may be well to accept the expertise of the 
scientist, the diplomat, the administrator, the artist, each in his own field; to avoid letting 
the fear of the disintegration of our Universities into isolated faculties determine our 
educational policy; to be concerned about but not overwhelmed by the possibility of there 
growing up two opposing and mutually antagonistic sets of values; and, on the other hand, 
to recognise how much men hold in common and to be encouraged by the fact that scores 
of talented expositors, of the highest standard in their own subjects, have not thought it 
beneath their intellectual dignity to write for the intelligent layman. And not only have 
they done so, but the intelligent layman, by hundreds of thousands, reads what they write 
and listens to what they say. ‘There is a gap. But we are not without the means of 
bridging it. V. T. SauNDERS. 


Masses of Nuclei 


by EO BePAUI 
Manchester University 


The study of masses of the different nuclear species is basic to the whole 
of nuclear physics. Because of the interchangeability of mass and energy the 
details of energy absorption or release are identical with mass differences or 
changes in the masses of participating nuclei. The systematic trends in mass 
value reflect therefore changes in the nuclear binding energy. It is one of the 
primary tasks of nuclear theory to explain the total binding energy of a lump of 
nuclear matter. The systematic trends in mass differences are also of basic 
importance in evolving models to describe the formation of the elements in our 
galaxy. 

For many years measurements of this fundamental parameter have been in a 
most unsatisfactory state. One primary method for the measurement of nuclear 
masses is the comparison of mass doublets in mass spectrometers of high resolu- 
tions. ‘The resolution must indeed be very high since to measure the mass of a 
nucleus of A= 100 to an accuracy of 1 kev requires a precision of one part in 108. 
Thus the mass spectroscopists had some difficulty in building instruments of 
high enough resolution. On the other hand by the mass spectrometric method 
the masses of essentially all the stable nuclei could be evaluated. The second 
primary method for mass determination is the measurement of total B-decay 
energies. ‘This measures the difference in mass between the radioactive nucleus 
and the daughter nucleus. It is relatively easy to do this with an accuracy of 
1 kev although since it involves the determination of the end-point of a continuous 
B-spectrum, there are the usual pitfalls connected with estimating the cessation 
of counts. In addition there is the danger that the prominent f-spectrum in 
fact leads to an excited state of the final nucleus and that what is being measured 
is not the difference in mass of the ground states but the difference between the 
ground state of the radioactive nucleus and an excited state of the daughter 
nucleus. ‘Thus the measurements must include careful B-coincidence studies as 
well and the total energy determination may well include a y-energy measure- 
ment which may be less accurate than the B-energy measurement. The third 
basic method of mass comparison is by measurement of the energy release in 
nuclear reactions. This involves the absolute energy measurement of one or 
more charged particles and of course sometimes a y-energy measurement also. 
Again these measurements can be done with rather high accuracy of the order 
of a kev or so in the mass difference. They depend critically however on an 
absolute knowledge of the incident particle energy (that is, the accelerator 
energy) and on a suitable comparison energy for the resultant particle measure- 
ment. 

However the latter two methods suffer from the drawback that not all stable 
nuclides or pairs of nuclides are easily accessible and elaborate decay and reaction 
cycles had to be devised to link up a large set of masses. Just one inaccurate 
measurement in the cycle would give rise to widespread discrepancies. For 
a long period the collection of mass spectroscopic results and the collection of 
nuclear results were disturbingly discrepant. However, the Proceedings of an 


300 E. B. Paul 


International Conference on Nuclidic Massest which was held in Hamilton, 
Canada, in 1960 show very clearly how much this situation has now improved. 

At this conference the mass spectroscopists, the B-spectroscopists and the 
nuclear reaction experts gathered together in an apparently amicable spirit and 
hopefully consistent sets of nuclear masses in which both sorts of data are included 
began to emerge. In fact most of the discussions in the opening sessions were 
concerned with exactly how the great mass of reliable data should be treated so 
as to evolve a self-consistent set of masses. In any least square analysis of data 
of differing origin and therefore subject to different systematic errors, some 
weighting must be assigned to each datum. Is it proper to use the quoted error, 
the deviation of the datum from trends of neighbouring data or some other 
more arbitrary criterion to assign weights? The experimental purists would 
avoid rejecting any data. The mass table compiler is almost compelled to 
reject some of the measurements. In some cases he finds that if he multiplies 
a whole body of data from one laboratory by a certain factor it then agrees with 
all the other data. Professor Dumond represented the cautious experimenter 
in warning against rejecting or correcting data merely on the basis of a large 
discrepancy without having a physical reason for doing so. A constant does 
not change with time but our knowledge of the value of the constant does! 

The theoretical interpretation to be placed on the tables of masses that are 
now available was the subject of another conference session. For many years 
nuclear physicists have made use of mass formulae such as the Bethe-Weizacker 
one to predict masses and fill in gaps in the table. Such a formula includes 
the mass energy of the appropriate number of free nucleons modified by the 
binding energy, a modifying term to correct for neutron or proton excess, a term 
to correct for the coulomb energy and a term to correct for surface effects. As 
more accurate mass tables became available the way in which such a semi- 
empirical mass formula had to be changed and refined to agree with the facts 
often gave valuable information about nuclear properties. ‘This was demon- 
strated very clearly in a paper by Wheeler and his collaborators who showed how 
nuclear deformation could affect the value of the surface energy term. Zeldes 
also demonstrated how starting from the shell model and pairing interaction 
the mass values could be reproduced with striking accuracy. One is led to 
hope that as soon as mass tables can be provided with slightly better accuracy 
the theoretical physicists will be able to provide a convincing explanation of why 
the trends are what they are. This is most important as it enables a reliable mass 
formula to be used to predict binding energies of new or unstable nuclides which 
are dificult to measure directly. 

The conference then turned to the experimental problems connected with each 
of the methods of mass determination which have been mentioned. The 
measurement of energy release (Q-value) in a nuclear reaction involves some 
absolute calibration of the energy of the particle initiating the reaction and of the 
energies of the resultant particles. The provision of energy standards for accelera- 
tors and for comparison in particle spectrometers has been a vexed question for a 
long time. On the former problem of accelerator calibration Marion showed 
that an accuracy of 0-1% is now reasonable provided sufficient care is taken by 


+ Proceedings of the International Conference on Nuclidic Masses, 1960, ed. H. E. 
Duckworth (University of Toronto Press). 


Masses of Nuclei 301 


the experimenter in the design of his experiment. The measurement of resultant 
particle energy has been made largely in magnetic spectrographs which are 
calibrated relative to certain natural a-particle energies notably 2Po. Un- 
fortunately a set of excellent modern measurements of this energy differ from a 
set of equally excellent older measurements by about 0-14. The older measure- 
ments are relative to an absolute RaC’ a-energy measurement while the later 
measurements are absolute. ‘The RaC’ absolute measurements are those of 
Briggs made in 1936 having an accuracy of about 0-01%. Absolute 2!°Po 
measurements are about 6 kev higher than those based on Briggs’ measurement. 
Another important measurement was reported by Browne in which he inter- 
compared the accelerator calibration characterized by the 7Li(pn) 7Be threshold 
energy and the *!°Po a-energy. His result would lead to a still higher value for 
the *!°Po if we assume the lithium threshold is correct. This would tend to 
confirm that there must be an error in the Briggs’ work but no one is yet in a 
position to say why it is wrong. Asaro proposed that because of some of the 
difficulties in using ?!°Po as a standard, another «-emitter 74Ra (ThX) be adopted 
as a standard. 

In the past there has been the difficulty that nuclear physicists who measured 
reaction Q—values did so not with the end in view of mass determination but 
rather to elucidate some other aspect of nuclear structure. The mass measure- 
ment was a by-product. Buechner and Mazari described a recent programme 
at the University of Mexico in which Q-value measurements were made on the 
nuclides where the gravest discrepancies with mass spectroscopic values showed 
up. ‘The result of this however was that the discrepancy (about 0-1°%) remains. 
If the nuclear spectroscopists adopted the lower value of the 7!°Po energy which 
is now thought to be wrong, they could get agreement with the mass spectroscopic 
work. Neutron capture y-ray energies have been another fruitful source of mass 
values especially in the heavy nuclei. ‘Two major groups—Kinsey and Bartholo- 
mew in Canada and Groschev in Russia—have made many neutron binding 
energy determinations and other groups are now entering the field. 

B-ray end points and y-ray energy determination have been steadily improving 
in accuracy and refinement. Lidofsky considered several subtle corrections 
which can affect the shape of the B-spectrum very near the end point and so 
introduce errors in estimating the end point by extrapolation of a Kurie plot. 
Graham described the elegant and powerful new f-spectrometers which are now 
coming into use in many parts of the world. Resolutions approaching 0-01% 
are now becoming almost commonplace. In the field of precision y-ray energy 
determinations Knowles described the progress that has been made since the 
pioneer work of DuMond using a curved crystal diffraction spectrometer. 
Knowles himself has developed a double flat crystal spectrometer which because 
the crystal does not have to be bent can use much thicker crystals and hence work 
to higher energies. With this he has made a redetermination of the deuteron 
binding energy, a constant our knowledge of which has in the past changed 
markedly with time. Knowles’ value is 2227-0 + 0-5 kev while the previous ‘best’ 
value was 2224-7+0-8 kev. The inconsistency of the errors quoted is very 
striking. Yet in 1947 the ‘best’ value was 2187 + 11. 

Modern mass spectrometers still basically follow the design of Aston in 1922 
in that they involve successive electrostatic and magnetic analysers. However 
because of the requirements for high precision they have become very large and 


302 E. B. Paul 


complex indeed. In addition several completely new kinds of mass spectrometer 
have been constructed. One of the most interesting is that which depends on a 
measurement of the cyclotron frequency of the ion. Although hitherto this type 
has not achieved the resolution of the conventional spectrometers Smith described 
a new version of this type which promises to achieve the highest resolution yet 
attained, perhaps 1 part in 10°. Essentially the number of parameters which 
must be held constant seem fewer than in the conventional spectrometer and the 
measurement of frequency can be carried out with fantastic accuracy. In the 
conventional mass spectrometer the problem of maintaining the acceleration 
energy, the electric field and the magnetic field sufficiently constant after many 
hours of operation is a formidable one and involves elaborate electronic circuits. 
Much work has also been done on the ion source which must operate with a great 
variety of input materials and which must introduce the ions into the analyser 
with appropriately controlled angular and velocity spread. 

The result of all this endeavour is that mass doublets can now be measured 
with accuracies which are somewhat better than nuclear reaction results. This 
is a situation which must surprise the nuclear reaction experts and should put 
them on their mettle. In fact this typifies what is happening in several fields of 
nuclear physics when the requirement now is for high precision. Workers in 
other branches of physics have known for a long time that precision measure- 
ments are very hard indeed and lead one to elaborate refinements of experimental 
technique and often to complex and expensive equipment. While nuclear 
physics was a new science almost any fact which a researcher established was 
valuable even if a numerical value were only known to an order of magnitude or 
perhaps to 20°%. Now however, in neutron data for reactor design, values of 
nuclear cross sections to a fraction of a percent accuracy are needed. ‘The 
position of energy levels of rotational nuclei can be predicted with very high 
accuracy by the Bohr-Mottelson theory and in order to study refinements to the 
theory energy level positions are required to be measured to a kilovolt or so. 
As we have seen Q-value measurements and f-decay energies are also needed 
to 0-1°% or so in order to show up details of the mass curve. 

These requirements are certain to change the character of experimental 
nuclear physics and many physicists who are used to the good old days of a cross 
section measurement from a three hour run on an accelerator would deplore it. 
It is clear that much of the future work will involve perhaps several years devoted 
to a single measurement. Perhaps one can also say that low energy nuclear 
physics has come of age. 


The Author: 


Professor Eric B. Paul, is in charge of experimental nuclear physics at Manchester. 
He came to Manchester in 1961, after five years at Harwell and eight years at Chalk River 
Laboratories in Canada. 


ESSAY REVIEWS 


High Energy Instrumentation 


Proceedings of an International Conference on Instrumentation for High-Energy Physics. 
Lawrence Radiation Laboratory, University of California, Berkeley. (New York 
and London: Interscience Publishers Inc., 1961.) [Pp. xvii + 320.] $10. 


During the last two decades high-energy nuclear physics has grown into one of the 
largest, and technically the most complicated, branches of physics. In the early days experi- 
mentalists requiring energies greater than a few Mev depended on the weak cosmic-ray flux 
of protons and neutrons for irradiating their apparatus. With the artificial production of pions 
in 1948, using a-particles from the 184-in synchro-cyclotron at Berkeley, much more complex 
experiments became possible. Inevitably the instrumentation became more sophisticated. 
Later the successful operation in various countries of several proton synchrotrons, producing 
particles from 1 Gev up to 30 Gey, has led to an even greater extension of instrumentation 
for experiments on the properties of strange particles and anti-nucleons. It was thus very 
timely that an International Conference entirely devoted to Instrumentation in this field of 
study should have been held at Berkeley in September 1960. 

The Conference was divided into six sessions, two of which were divided into parallel 
sessions. No discussions on the theory and construction of accelerators were included and 
only one session was devoted to beam transport and closely related topics. Several papers on 
beam-handling equipment were included in a miscellaneous session at the end of the Conference. 
‘The main emphasis was on particle detectors of several kinds with a brief exposition of modern 
methods of data reduction. 

‘The experimental techniques in common use in high-energy physics may be divided into 
two broad categories, namely visual detectors and counters of various kinds. Among the 
visual detectors are the photographic emulsion, the Wilson chamber, the bubble chamber, the 
luminescent and fibre chambers, and spark chambers. Counters included gas filled counters, 
scintillation counters, Cerenkov counters of many kinds and solid-state detectors. Almost all 
of the instruments were mentioned in the Conference but naturally only a few were discussed 
at length. 

Photographic emulsions are still of great value, particularly with the very high-energy 
machines such as the CERN proton synchrotron and the Brookhaven A.G.S., but the tech- 
nique was not discussed at the Conference. The Wilson chamber is nearly obsolescent but 
bubble chambers are still widely used and many new ones are under construction. The 
session devoted to the chambers produced some useful papers both on the construction of the 
chambers and on the quality of the measurements that can sometimes be made on the track 
images. Several papers on ionization measurements in propane and hydrogen chambers are 
of considerable interest. A good deal of time was devoted, as always at sessions of this kind, 
to the problem of how big to make future chambers. No significant new ideas emerged from 
the discussions. 

Progress with luminescent and fibre chambers is disappointing, consequently few papers 
on these devices were presented. In fact no successful experiment using these techniques 
which looked so attractive a year or two ago, has yet been completed. 

Spark chambers are new and were discussed at some length during the Conference. 
Essentially the spark chamber consists of a series of parallel plates in a gas filled enclosure. 
When a conventional counter set indicates that a wanted particle has traversed the chamber 
intense electric fields are established between the plates. Breakdown occurs where the 
charged particles passed through. The sparks which are produced can be photographed and 
produce a ‘track’ in the chamber. Several papers were presented which indicate that recent 
improvements to the spark chamber technique, which is basically quite old, make it poten- 
tially of great value in high-energy physics. Numerous spark counters are now being 
made, several of which are very large. For example, a large spark chamber containing 10 
tons of material in the plates is being built at Brookhaven in the hope that it may detect high- 
energy neutrino interactions. It seems likely that spark counters in various forms will prove 
to be of great value mainly because they have to be triggered by a counter system and thus 
can work in very intense beams. Unfortunately the expansion of the bubble chamber can- 
not be counter controlled and, excluding the possibility of pencil beams, the bubble 
chamber only works well in beams of modest intensity, say 20 particles per pulse. 


304 Essay Reviews 


Some new Cerenkov counters, both liquid and gas-filled, were described at the Conference. 
Several papers were devoted to new solid-state circuits for pulse-height analysers and other 
purposes, such as tunnel diode high-speed circuits. Brief mention was made of the possibility 
of using solid-state detectors in high-energy physics. 

Finally, a session was devoted to bubble chamber data reduction problems. Several 
review papers on recent developments are of great interest. Unfortunately these problems 
are very complex and are difficult to explain in short papers. Brief mention is made of the 
growing problem of data reduction following counter experiments. 

This Conference Report is well edited and produced, except for the half-tone blocks, which 
are hardly of adequate quality. ‘The volume provides the reader who did not attend the 
Conference with an excellent survey of a very vast field of work. Many of the papers are 
rather short. In fact it is a failing of experimental physicists that they often refrain from 
reporting adequately on the instruments they invent and the techniques they develop. Regular 
International Conferences on Instrumentation will help to disseminate information, particu- 
larly if the Conference reports are produced rapidly and are widely available. 


CAC BurrEr: 


Isotope Separation 


Separation of Isotopes. Edited by H. London. (London: George Newnes, 1961.) 
[Pp.xvi + 488] 80s. 


Almost as soon as the existence of isotopes of the stable elements was demonstrated, 
physicists began to search for methods by which appreciable quantities of these could be 
separated. There are in fact many physical processes which are slightly affected by the 
differences in isotopic mass and almost all of these have been tried out as methods for 
isotope separation. However all of this was on a fairly academic plane until the wartime 
programme on nuclear energy led to an enormous effort primarily devoted to separating 
the uranium isotopes. ‘The thermally fissile 2°°U had to be separated from the non-fissile 
238L] to make the first nuclear weapon and later uranium enriched in ?*5U was required 
to fuel high flux reactors. Since in natural uranium the fissile isotope occurs with an 
abundance of only 0:7 per cent. and of course differs in mass by only 1:3 per cent. the 
problem was very difficult indeed. Other requirements of the nuclear energy industry 
for materials of either very high neutron absorption for control purposes or of very low 
neutron absorption for moderation purposes also led to needs for separated isotopes of, 
for example, boron and hydrogen. ‘lhe problem had then changed from that of producing 
a few micrograms or less of an isotope for nuclear physics studies to that of creating large 
factories for producing kilograms of the required isotope. Before large amounts of money 
and technical effort could be risked on such installations the relative efficiencies and costs 
of many possible systems had to be carefully evaluated. ‘Thus in a few years from being a 
relative by-way in technological science, isotope separation has become a major industrial 
commitment at least for the nuclear powers. The subject now has a large literature and is 
of vital interest to a wide variety of scientific disciplines. 

The present breadth of the subject is well illustrated in a new volume Separation of 
Isotopes, edited by H. London. ‘The field encompasses nuclear physics, thermodynamics 
low temperatures and chemical engineering amongst other sciences. Dr. London, who 
an expert in many facets of the subject, contributes the introductory chapters and some 
16 experts have prepared the remaining detailed discussions. These are all British or 
European scientists, although the work on the other side of the Atlantic is fully described 
The style of the book is in some respects that of a handbook. This is understandable aoe 
many of the methods were investigated in parallel and have been brought to similar stages 
of development. ‘The basic theory, experimental investigation and practical realization of 
some twenty odd methods of isotope separation are described. It should be noted that 
the preparation of tracer quantities of isotopes such as the radioactive ones is not treated 
at all. Only those methods capable of extension to production of macroscopic quantities 
are included. 

All of the methods depend basically on the small difference in mass of atoms of the 
different isotopes. One broad class of phenomena depend on the fact that in thermal 


Essay Reviews 305 


equilibrium the light and heavy atoms will] have different free energies. "This leads at 
once to different diffusion rates, different behaviour in a thermal gradient, different vapour 
pressures and so on. Another broad class of phenomena depends on the fact that some 
chemical reaction rates are slightly different for two atoms of different mass. Another 
class depends more directly on the inertia of the different atoms so that separation is 
achieved by subjecting the atoms to a centrifugal force in a centrifuge. Finally the difference 
in momentum of electrically charged atoms of different mass can be detected in a magnetic 
field. In addition there are a number of other isotopically sensitive phenomena which in 
principle could be used for separation but have so far not been seriously applied. One is 
the shift in spectral lines from radiating atoms resulting from the perturbation of the 
electron orbits by the nuclei of different mass. Another is interesting and efficient but is 
unfortunately only applicable to the helium isotopes. This depends on the fact that near 
absolute zero heliurn of atomic weight 4 is superfluid while that of mass 3 is not. By 
providing a leak which only a superfluid can penetrate the He* is readily removed from the 
He’*. Finally a method mentioned in this book depends on the fact that isotopes in general 
differ not only in mass but also in nuclear spin. Provided this spin is not zero a separation 
which depends on the spin value could in principle be made. 


The technique of fractional distillation is reviewed by T. F. Johns. The small difference 
in boiling points between two liquids of different isotopic composition is typically a small 
fraction of a degree. It is a result of small differences in the free energy of the liquid- 
molecule resulting from different frequencies of the vibrational or rotational molecular 
excited states. For many of these techniques based on small effects the counter-current 
system must be used to multiply up by large factors the basic small separation. The practical 
techniques for achieving this are crucial. Plants for separating light isotopes up to argon 
have been built to use this method. A closely related system depends on differences in the 
solubility of a gas in a liquid solvent but this has not been applied widely as yet. 


The various applications of chemical exchange to isotope separation are reviewed by 
T. F. Johns, J. Bigeleisen, R. W. McIlroy, H. R. C. Pratt and E. Gleukauf. This system 
has been most applied to separation of the light elements D, C1*, N1*, O1* for example. 
Again the effect depends on differences in the free energy because of displacements of the 
vibrational levels but in addition the ratio of the chemical bond strengths in the two 
chemicals must be appropriate. The two chemicals are contacted by the counter-current 
system but there is now the additional problem of returning the chemicals to their original 
form continuously (refluxing). In some cases this can be done by a chemical reaction, but 
in special cases the refluxing can also be accomplished by dissociation under heating. 
In the latter case the technique is called Exchange Distillation. In special cases, so far 
only applicable to the hydrogen isotopes, the direction of the reaction is itself a function of 
temperature. In this case a dual temperature system automatically accomplishes the 
refluxing. Various combinations of gas, vapour and liquid can be employed with quite. 
different constructional problerns in each case. Closely related are systems going under 
the name of chromatographic methods in which one phase is made to flow through a 
stationary solid or liquid phase. A well-known example is the ion exchange column which 
has been widely used also in tracer quantity separations of radioactive isotopes. Although 
attractive and widely applicable this technique has not been used thus far for large scale 
separations. 

We now turn to the only direct application of the inertia differences. The technique 
of the gas centrifuge is reviewed by W. Groth and K. Beyerle. ak he system depends on 
introducing the material to be enriched in liquid form into a centrifuge. Vapour is then 
removed from the axis of the centrifuge and the liquid remaining is enriched in the heavier 
isotope. The use of this device has been much discussed recently as a possibly more 
economical way of separating the uranium isotopes. It may prove to be the best system 
if mechanical improvements in the centrifuge continue. 


Various different techniques are grouped under the name of irreversible statistical] 
processes, in contrast to those already mentioned which are reversible. The systems of 
thermal diffusion are reviewed by K. E. Grew. The idea of using convective circulation 
in a gas to induce a relative motion of the isotopic constituents was first used in a practical 
way in the column of Clusius and Dickel. In its simplest form it consists of an electrically- 
heated wire down the centre of a cold cylinder. The lighter component is concentrated 


C.P. U 


306 Essay Reviews 


near the wire and swept up by convection while the heavier component is carried down 
the cold walls. This has been used for the lighter gases in fairly small scale installations. 
Other systems involving diffusion either through a membrane or through another gas are 
surveyed by C. Boorman. The first system is essentially very simple. The lighter molecules 
strike the semi-permeable wall more frequently and so pass through more readily. Large 
scale use of the system involves mainly problems of the exact nature of the membrane. 
It was this system which was finally chosen for large scale production of uranium isotopes 
in the U.S. and the U.K. and very large plants indeed have been built. Diffusion through 
another gas involves rather more complex column structure and has been less used. 

The separation jet technique is discussed by E. W. Becker and in principle is rather 
simple, the lighter component being concentrated in the peripheral region. Although not 
used on a large scale it seems possible that it could compete with gaseous diffusion. T. F. 
Johns discusses molecular distillation in which a liquid is boiled at low pressures and 
condensed on a cold surface. A counter-current system has been devised, but on the whole 
the separations achieved are not very favourable. Ionic migration which depends on the 
relative ionic velocities in an electrolyzed solution was suggested at a very early stage. 
Practical systems for a wide range of isotopes have been devised using both aqueous solu- 
tions and molten salts; H. C. Cole reviews this method. 'The electrolytic method has been 
used in particular for the separation of heavy hydrogen. P T. Walker, who discusses the 
subject points out that the details of the separation process are, surprisingly, still rather 
mysterious. The intriguing possibilities of using biological phenomena for isotope 
separation are summarized by P. J. Syrett. None of these have proved practical though 
it is clear that algae and bacteria do achieve some separation. It is probable that the algae 
effect is diffusion through the membranous cell walls while the bacterial effect is catalysis 
of a chemical exchange reaction. 

R. H. Bernas and M. G. Ranc review the more specific non-statistical methods. 

Selective excitation of a chemical reaction depends on the isotope shift in the spectra. 
Use of this effect is still at an early stage of development and may prove valuable for certain 
elements. "The use of magnetic fields to separate individual ions according to their 
momentum dates back to the earliest work of J. J. Thomson, and the original mass- 
spectrograph of Aston; it first demonstrated the existence of isotopes. During the war an 
extension of this technique underwent major developments since at one stage it was a 
major competitor with gaseous diffusion for uranium enrichment. The fantastic installation 
at Oak Ridge was originally used to separate the first pure samples of ?°°U but has since 
been devoted to separation of high purity samples of almost all the stable isotopes for 
research purposes. ‘The system is also suitable for separation of gram quantities of highly 
radioactive isotopes and the HERMES separator at Harwell which is essentially built inside 
a glove box has been used to separate samples of the plutonium isotopes. The development 
of the ion sources for such separators has been a critical feature of these advances in 
technique. None of the other methods mentioned can approach the electromagnetic 
separator for preparation of really pure samples though of course the method is very 
inefficient and costly. Other variations of the electromagnetic system have been proposed, 
such as the resonance method in which the cyclotron resonance of the ion in a magnetic 
field is the critical parameter, and the Isotron in which purely electrostatic fields are used. 
Neither method has proven practical. 
This brief resumé of the field illustrates the wide range of technical skills which are 
involved in isotope separation. It would appear that almost all practical methods have been 
reasonably well evaluated by now and that the possibility of a major novel break-through 
is rather unlikely. However, the ideal method for large scale production of low enrichments 
is probably by no means settled and a method which reduces the cost of separation in a gas 
centrifuge, for example, could change the picture overnight. For large-scale production of 
high enrichment material the electromagnetic separator appears to be in an unassailable 
position and to increase the scale of this work involves simply duplication of these very 
costly units. In special cases the production of high enrichment of a specific isotope can 
be achieved following the decay of a radioactive element. This topic is not touched on in 
this book, but for example, almost all He® is now derived from the decay of tritium so that 
more standard methods of He® enrichment are not now used. However the clear treatment 
of a wide range of techniques in this volume will guarantee that it remains a useful reference 
work for many years. E. B. Pau. 


BOOK REVIEWS 


The Inspiration of Science. By Str Grorce THomson. (Oxford University Press, 1961.) 
[Pp.x+150.] 18s. 


In an attempt to explain—“ even to those with little or no physics ’—how the research 
scientist thinks, the author first examines what science is trying to do and emphasizes the 
intellectual appeal of the work. An excellent account of the scientific method (which he 
treats as a “ guide to the explorer’ rather than “a royal road ”) is followed by examples 
from the field of physics, tracing the development of the concepts of the atom and its 
nucleus in the early part of this century, relativity and the energy-mass relationship. 
Turning to the matter of experiment, he illustrates how luck, accuracy of measurement and 
tenacity—especially with the trivial—can lead to new discoveries, and, in the final chapter, 
deals with matter waves and the uncertainty principle. To add to the interest, there are 
eleven short biographical sketches of physicists concerned with the developments described 
and known personally to the author, no living physicist being included. 

This is a most readable book and should certainly be recommended to all sixth form 
science specialists. It could well form the stimulating basis of a course in science for the 
non-scientist but the reader with no knowledge of physics would find much of the book 
difficult to follow—in spite of the glossary of technical terms at the end. It is a pity that 
some of the more recent advances and associated problems were not included. 

G. E. Foxcrorr. 


The Third Law of Thermodynamics. By J. Witkxs. (Oxford University Press, 1961.) 

[Pp. viii+142.] 15s. 

When your reviewer was taught physics under the late J. K. Roberts at Cambridge in 
the middle 30’s, the Nernst Heat Theorem was applied only to the entropy change in 
chemical reactions at absolute zero, and was not indeed referred to as the third law of 
thermodynamics. Roberts says in one edition of his book that one would not expect the 
Nernst Heat Theorem to apply to liquid helium. In the introduction to this book, the 
author points out that the scope and significance of the theorem have been steadily increased, 
largely by the late Sir Francis Simon, and that the law is now a principle both of physics 
and chemistry. It is very interesting to see how Honours students of both Physics and 
Chemistry can now be taught this as a general principle and not as a particular law relating 
purely to chemical reactions. The author points out that an informed treatment of the 
third law is only possible if entropy is thought of as a statistical as well as a thermodynamic 
function and that the third law discusses the behaviour of the entropy function, whereas 
the second law merely postulates the existence of such a function. 

The first two chapters therefore develop the idea of entropy and probability in a rigorous 
mathematical way, but they clearly relate it to a measure of disorder as well as a straight 
thermodynamic function. The third law is then stated in the form “ the contribution to 
the entropy of a system by each aspect which is in internal thermodynamic equilibrium 
tends to zero at absolute zero”’. It follows immediately from this that the specific heats 
of all substances tend to zero at absolute zero and so do expansion coefficients. 

The application of the law to systems which are not in thermodynamic equilibrium is 
more complex and is discussed in a special chapter. For a perfect gas, the Boltzmann 
distribution ceases to be valid below a certain characteristic temperature. 

In the next chapter it is shown that in certain special cases, the third law can be derived 
as a consequence of statistical mechanics, but it is not possible to do this for the perfectly 
general case. 

In the ninth chapter the theorem is applied to chemical equilibria which was, of course, 
as mentioned above, its earliest application, while in the tenth chapter, the unattainability 
of the absolute zero is discussed as a consequence of the third law. In general, it can be 
said that the book is very clearly written and the explanations should be adequate for the 
third years Honours undergraduate, but that, as is inevitable in a developing subject, some 
of the chapters start with a clear statement of a question without being able to give a clear 
answer to it at the end. M. W. TuRING. 


308 Book Reviews 


Lectures in Theoretical Physics. Vol. III, Edited by WESLEY F. ButtTin, B. W. Downes, 
and JOANNE Downes (University of Colorado). (New York and London: Interscience 
Publishers, 1961.) [Pp. 531.] 83s. 


This volume contains the text of the lectures of the 1960 Summer Institute for Theoreti- 
cal Physics at Boulder. The editors and publishers once again have managed to produce 
these proceedings quickly and at a reasonable price, but they have not seen fit to add an 
index to the volume—an oversight which surely need not have delayed the publication or 
increased the price. This volume contains lectures by A. Bohr on Dispersion Relations, 
R. Zwanzig on Irreversibility, B. L. Moiseiwitsch on Scattering, M. Green on Molecular 
Distribution, E. Montroll on Non-equilibrium Statistical Mechanics, R. Haag on Collision 
Processes and Field Theory, R. Balescu on Non-equilibrium Phenomena, N. Ashby on 
Green Function Methods, and K. Symanzik on Quantum Field Theory and finally, Wield: 
Weisskopf on Selected Topics in Theoretical Physics. Most of the lectures are for people 
well qualified in theoretical physics and should not be of great use to anybody who has not 
got at least a first degree but Weisskopf is at his best in discussing various, not necessarily 
related, current topics emphasising the physical contents of the problem and showing with 
a minimum of formulae how the difficulties can be resolved. ‘The topics he discusses are: 
the apparent rotation of objects when seen by a rapidly moving observer, gravity waves, the 
Mossbauer effect, the chemical bond, and a few low energy nuclear physics problems. 

There is no doubt in this reviewer’s mind that this volume will take its place along with 
the previous ones as an extremely useful reference book. D. TER Haar. 


Thermoelectricity—Science and Engineering. By Ropert R. Heixes and ROLAND W. URE 
(Jr.). (Interscience Publishers, 1961.) [Pp. xi+576.] $18.50. 139s. 


Since loffe’s book was published in 1956, there have been several books dealing with 
thermoelectricity, but except for Goldsmid’s Methuen Monograph, these have been based 
on conference reports and have not given a systematic treatment of the subject. This new 
book by Heikes and Ure with a number of colleagues, is very comprehensive and meets 
very well the need arising from the growing interest in this subject. The book deals with 
the basic physics and chemistry of semiconductors in general and of thermoelectric 
materials in particular, the design of materials for thermoelectric applications, the properties 
of the most important materials known at the present time, and the technology of thermo- 
electric refrigerators and generators. 

Much of the material in the book has been presented in earlier texts and in published 
papers, but it has not before been coordinated so satisfactorily . ‘There are however several 
items not published previously, in particular on narrow-band semiconductors, radiation 
damage, crystal growth, figure of merit, low temperature cooling , some of the thermoelectric 
materials, and on several aspects of device technology. 

The book is well written and is an authoritative account by numerous members of the 
Westinghouse team of the subject upon which they have been actively and enthusiastically 
engaged for several years. D. A. WRIGHT. 


Dispersion Relations and the Abstract Approach to Field Theory. Edited by Lewis KLEIN. 


(International Science Review Series, Volume I.) (New York: Gordon and Breach, 1961.) 
[Pp. x+273.] $4.95. 


Recently, the number of physics books which are essentially a collection of separate 
articles has increased greatly—not always with great justification. ‘There are, however, 
reasons why the publication of the present volume can be justified. It brings together a 
number of the original papers dealing with dispersion relations in high energy physics and 
with the so-called abstract approach to field theory. Moreover, those papers which appeared 
originally in German are published here in a translation. In many ways it is of educational 
value to go back to the pioneering papers in a subject, and this is made easier if such papers 
are collected as is done here. From the title of the volume it should be clear that only 
experts can benefit from it, but for them it forms a welcome addition to the steadily growing 
literature on dispersion relations. D. Ter Haar, 


Book Reviews 309 


Aurores et Ciel Nocturne. (Centre National de la Recherche Scientifique, Paris, 1961.) 
[Pp. 66.] 


This is a report of French work on aurora and airglow undertaken during the Inter- 
national Geophysical Year 1957-58 and International Geophysical Co-operation 1959. 

Visual observations of aurora were made at meteorological stations, the two French 
Antarctic stations in Terre Adélie and Kerguélen, and at the observatories of Haute Provence 
and ‘Tamanrasset in the southern Sahara, in ships at sea and by voluntary observers. 

The station at Kerguélen, being far separated from other Antarctic auroral stations, is 
therefore important for morphological studies, but unfortunately cloud seriously hampers 
observations. Accordingly, a photoelectric photometer was arranged to operate at 3914 A 
(N>* band) and provided a measure of auroral intensity little affected by sky conditions. 
Correction for moonlight is described, tables give auroral intensity on a four point scale for 
each observed quarter hour, and comparison is made with observations at Haute Provence 
of the radiation at 6300 A (OJ). 

The third and last section of the report deals with the measurements of airglow made ate 
Haute Provence and Tamanrasset. Three types of photoelectric photometer and the 
calibration of each are described and measurements for various airglow radiations are 
recorded in an extensive series of graphs and tables. 

Lists of references to papers concerned with the analysis of these observations are pro- 
vided at the end of each section and testify to a fine achievement in this field. 

James PaTON. 


Hydrodynamics. By Garret BiRKHOFF. 2nd Edition. (Princeton : University Press, 
London : Oxford University Press, 1961.) [Pp. xi+184.] 42s. 


This is the second (enlarged) edition of an unusual work which was first published in 
1950. Its full title is ‘ Hydrodynamics, a Study in Logic, Fact and Similitude’ and it is 
necessary to emphasize from the outset that it is not a text book in the usual sense of the 
word. No one could learn hydrodynamics or aerodynamics ab initio from Professor 
Birkhoft’s book, but those who have studied the subject in the usual way, say through the 
works of Ramsey, Lamb or Milne-Thomson, will find from this slim volume that there is 
still much to be explained in the motion of air or water. 

The mathematical treatment of the motion of fluids necessarily demands a great amount 
of idealization, with the result that conclusions drawn by impeccable mathematical reasoning 
from the classical equations sometimes conflict with experimental results with real fluids. 
Early in the history of hydrodynamics, D’Alembert was disturbed to find that he had 
proved that the resistance experienced by a solid of any shape moving with uniform 
velocity through an unbounded frictionless fluid, otherwise at rest, is zero. This result, 
since known as D’Alembert’s Paradox, arises because of the omission of viscosity. 
Professor Birkhoff uses the word ‘ paradox’ to denote any conclusion drawn by formally 
correct mathematical reasoning which is manifestly at variance with the experimental data. 
His paradoxes are thus not like those familiar in mathematical logic, such as the ‘ class of all 
classes’ paradox of Russell. Usually, the discrepancy means that the initial idealization 
is too restrictive. At times the word ‘ paradox’ is given a very extended meaning by 
Professor Birkhoff; for instance, the familiar result that the Poiseuille-Hagen law for viscous 
flow through capillary tubes breaks down for flow at large Reynolds numbers is referred to 
as the ‘ turbulence paradox’. But here is an example of transition from a very simple type 
of motion to a far more complex mode, which as yet has defied exact analysis. This is 
hardly comparable with D’Alembert’s historical result. 

The discussion of the reliability of deductions from the classical equations occupies the 
first two chapters of the book. It is followed by an account of phenomena in jets, cavities 
and wakes, and then by a theoretical examination of the use of models and applications 
of the more sophisticated methods of dimensional analysis. Finally, there are sections 
dealing with group theory in fluid mechanics and a discussion of ‘ added mass ’ (the apparent 
increase in the inertia of a solid caused by acceleration in a fluid medium), topics which cannot 
be adequately discussed in a short review. 


310 Book Reviews 


To summarize, this book is not a systematic treatise but a series of essays intended first 
to disturb the complacency (if it exists!) of mathematical physicists and second, to bring 
into fluid mechanics some rather recondite parts of pure mathematics. It is not for arm- 
chair reading but demands the concentrated attention of a reader already versed in the subject. 
It can be recommended to post-graduate students and others who, having read the customary 
texts, are not content to take for granted all that is said in them. O. G. SUTTON. 


Les Recherches Galactiques et Extragalactiques et la Photographie Electronique. (Paris : 
Centre National de la Recherche Scientifique, 1960). [Pp. 161.] 


This volume of 24 papers, presented at a C.N.R.S. conference in Paris in 1959, seems 
at first sight to divide itself somewhat arbitrarily between the astronomy of very large-scale 
objects and, on the other hand, some recent electronic devices for the building of the 
photographic image. But the juxtaposition is not irrational, as extragalactic problems 
depend critically upon the improvement of images and the increase of resolving power and 
spectral selectivity. Names famous in American, Russian, Dutch, French, Belgian, 
Scandinavian and German astronomy are represented ; not many British except M. J. 
Seaton and D. S. Evans. 

Physicist readers will select according to their laboratory interests, but I would empha- 
size the importance to general scientific knowledge of the following papers, numbered 10, 11, 
14 and 23 in the book : 

J. H. Oort and G. van Herk offer an impressive piece of mature and cautious reasoning 
on the evolution of globular star-clusters based on the expulsion of gas by stars approaching 
the White-Dwarf density. 

Vorontsov-Velyaminov presents an atlas of those strange extragalactic objects which 
appear to be linked together by tenuous filaments which may be partly gaseous and partly 
immense stellar aggregates. Gravitational and also magnetic forces are discussed critically. 
It is now doubted whether their first name of ‘ colliding galaxies’ has any rational support 
in the study of the enormous structures. 

Minch connects the analysis of our own galactic structure by 21 cm radio emission 
with the relevant stellar evidence as to rotation of the Milky Way. 

Dufay reports on the infra-red emission and absorption in the central regions of our 
galaxy—one of the really baffling regions of the accessible universe today. 

MarTIN JOHNSON. 


Modern Physics. By D. E. Caro, J. A. McDonett, and B. M. Spicer. (Melbourne : 
F. W. Cheshire, 1961.) [Pp. 222.] 


This book is intended principally for first-year University students, including those 
whose main subjects are biological. The quiet and matter-of-fact style, the obvious care 
which has been taken to obtain accuracy of detail, and the very straightforward mathematics 
used, are fully consistent with such an aim. 

After a brief survey of atomic ideas from ancient times to the 19th century, the authors 
deal with some major discoveries in atomic physics near the turn of the present century. 
They then cover atomic structure as far as the Bohr atom (with a brief mention of later 
developments), and finally outline the progress made in nuclear physics up to the present 
day. 

The choice of subject matter, and its order of presentation, are good, and there is a 
remarkable freedom from errors and misprints. But there is some bias towards experi- 
mental fact at the expense of theory. Wave mechanics is dismissed in a single page ; a 
fuller, if purely qualitative, account seems called for at this level. The treatment of beta 
decay is also open to mild criticism. Thus no distinction is drawn between neutrinos and 
antineutrinos (except in a table) ; and electron capture receives only the briefest mention 
in a footnote. 


The book is well illustrated, and references to original papers appear throughout. There 
is also a useful bibliography. M. S. Situ. 


Book Reviews 311 


Standard and Rapid-Run Magnetograms. Chambon-la-Forét (France). July 1957—March 
1958. (Paris: Centre National de la Recherche Scientifique, 1961). [Pp. 289.] 


Part of France’s contribution to the International Geophysical Year 1957/58 was to 
maintain, and make available the data from, the geophysical observatory at Chambon-la-Forét 
operated by the Institute de Physique du Globe in Paris. The main observational records 
from this station consist of continuous registrations at normal sensitivity and speed 
(15 mm/h) of the three components of the geomagnetic field, horizontal and vertical face 
and declination; a subsidiary set of magnetographs register H and D at greater sensitivity 
and at about five times the normal speed. 

This volume provides reproductions of the actual magnetograms for each day, reduced 
to about 0-4 of the true size, both for the normal and the quick-run records. The repro- 
ductions are clear and sufficient in size to serve their purpose of allowing investigators to 
pick out for special study particular perturbations or features in the constantly changing 
field. At the beginning of each month all necessary details are given for the interpretation 
of the records. 

The volume covers the first half of the I.G.Y.; a second volume will complete the year. 
These two volumes, together with the publication of hourly values and other basic material 
in tabular form by the Institute de Physique du Globe, and a later volume which will discuss 
the morphology of geomagnetic disturbances and particular types of movement, will form 
a most valuable contribution to our knowledge of the earth’s magnetic field in middle 
latitudes. JEEVES STAcG: 


The Language of Mathematics. By FRANK LAND. (London: John Murray, 1960.) [Pp. 264.] 
ANG 


Dr. Land, the author of this volume. is also responsible for Recruits to Teaching, an 
investigation into the mathematical attainment of Training College students, and it was 
this experience, so he tells us, that provided the starting point of his new work. It is 
intended to manifest the advantages of being able to ‘ think mathematically’. ‘The need 
for numeracy in the modern world was recently underlined in the Crowther Report: “‘ the 
man who is innumerate ”’ it said ‘‘is cut off from understanding some of the relatively 
new ways in which the human mind is now most busily at work.” 'The new ways can be 
illumined by the old, and Dr. Land has not hesitated to use the past to direct his readers’ 
attention along some of the vistas which mathematics opens up. ‘There have been several 
attempts earlier, notably Hogben’s Mathematics for the Million, to make mathematics 
available and comprehensible to the many. It is no easy task. ‘This new book may seem 
by the simplicity of its writing and the wealth of its illustrations to make the difficulties 
disappear. 

The first part of the book deals with numbers and how number systems have developed, 
a reminder to many readers of what they were doing when years ago they did long division, 
and also an insight into how in earliest times and later the various notations were adopted 
and changed. ‘There follows an interesting chapter on time and the calendar, and an 
introduction to compound units through illustrations taken from everyday physics. ‘This 
should be as valuable to teachers as it is likely to be fascinating to others. 

Chapter 7 headed Algebra will seem very different from school algebra books of other 
days, and yet in simple language it emphasizes the essentials. Firstly, that every statement 
in algebra must be formulated as a complete sentence, or put another way, that an algebraic 
equation is an expression in symbols of a statement that must first be formulated in ordinary 
language. Secondly, that there are three stages in the solution of any problem; the formu- 
lation of the situation in mathematics, some algebraic manipulation to lead to the ‘ solution ’, 
and finally the translation of the solution back into the original context of the problem. 
Sets, functions, one-to-one correspondence are amongst the terms defined and tellingly 
illustrated, whilst progressions come under the general heading of ‘ logs, pianos and spirals ’. 
Number sequences, such as the squares of the integers, are given geometrical representations 
and examples are taken from many fields, including the Jodrell Bank radio telescope and 
the sizes of animals. 

We are reminded of the disappearance of the old teaching of Euclid and of the changes 
that have taken place in geometrical thinking. The chapter dealing with ‘ shapes and sizes * 


312 Book Reviews 


introduces similarity and leads on to elementary trigonometrical notions, mensuration, 
maps and the orbits of satellites. It concludes: “A willingness to take out rulers and measure, 
to make judgements about sizes, shapes and proportions and to look critically at such shapes 
as those of smoothing irons, motor cars, houses, office blocks and cathedrals 1s to be expected 
of intelligent people who take a lively interest in their environment. ‘This should lead to 
appreciation of design and a critical appraisal of both art and industrial products. The active 
interest in shape and size may well provoke enquiry into formal geometrical relationships.” 

The book closes with an account of the Fibonacci sequence and the golden section, 
followed by an introduction to statistics, and the normal distribution, with some comments 
on measurement of intelligence and correlation. 

It is an attractive volume, delightfully designed with excellent illustrations, some of which 
are in colour. ‘The Shell Petroleum Company are to be commended for making possible 
the publication of such an effective contribution to the wider understanding of mathematics. 
It does make clear “‘ the sort of way in which mathematical thinking can illuminate our 
understanding of some very ordinary things in the world about us ”’. 

Dr. Land has also produced a small collection of exercises which should help the sort 
of reader for whom the book is designed. It is published separately so that it might be 
more easily revised from time to time. J. Toppinc. 


Elements of Modern Physics. By PauL CopELAND and W. E. BENNETT. (Oxford University 
Press, 1961.) [Pp. ix+507.] 15s. 


The sub-title of this volume ‘ A course in those aspects of modern physics that underlie 
important engineering developments ’ is frankly too ambitious for the compass of 400-odd 
pages. In fact this is a book in which a limited selection of topics in ‘ modern’ physics 
is dealt with adequately. ‘The treatment of the subject matter is clear (and that means 
some mathematics), and uncluttered by the detail which a physicist might expect. The 
book is directed towards the enquiring engineer, whose purposes it will suit admirably 
within the limitations of its scope. In the first two parts the authors are mainly concerned 
with the quantized behaviour of matter, especially in the solid state, and they develop some 
of the basic ideas of wave mechanics and quantum statistics to deal with electrons in metals 
and semiconductors. The treatment is of a simple non-rigorous kind, and a great deal of 
solid state theory (e.g. magnetic properties and crystal imperfections) is omitted. The 
qualitative approach is more evident in the third part of the book dealing with nuclear 
physics in which it is sought (quite successfully) to explain the principal ideas without 
recourse to any nuclear theory (surprisingly, pile physics is omitted). Eleven useful mathe- 
matical appendices are included, and at the end of each chapter a list for further reading is 
given. K. I. Mayne. 


Linear Differential Operators. By C. Lauczos. (D. van Nostrand, 1961.) [Pp. xvi 
+564.] 80s. 


This is an outstanding book, wider in content than the title alone would seem to indicate. 
It is, as the author declares in his preface, ‘ written at an advanced level’ and in the hope 
that it “ will stimulate discussions and research at the graduate level’. It is written, again 
the author provides the phrase, in a ‘ discursive manner of presentation ’; a manner that fails 
to attract me personally, but one with many points in its favour when it comes to showing 
the relation of one part of mathematics with another and to stressing the ideas behind the 
mathematical manipulations. 

The first three chapters are introductory, but already cover much ground in the general 
problems of interpolation, harmonic analysis and matrix calculus. "Then comes a chapter 
on function space followed by chapters on Green’s function, communication problems, 
Sturm-Liouville problems, boundary-value problems and the numerical solution of 
trajectory problems: the whole done with massive scholarship and detail. 

As a book to ‘ read ’, I judge it to be very demanding: as a book to be studied intensively 
in particular chapters and for the purpose of applying the theory to further problems, I 
judge that it might well prove very stimulating: and as a book of reference for the scientist 
likely to need advanced mathematics in his work, I judge it to be of considerable weight 
and importance. The printing is excellent. W. L. Ferrar. 


Book Reviews 313 


Climatic Data from Dumont D’Urville (April 1956—December 1958). (Paris: Centre 
National de la Recherche Scientifique, 1961). [Pp. 169.] 


The International Geophysical Year (and a half) 1957/8 was a wholly exceptional time 
in the Antarctic. Nearly fifty stations on or around the continent maintained continuous 
observations of the meteorological or other geophysical elements throughout the period, 
some even starting a few months before the official starting date of 1 July 1957. This 
volume summarizes the meteorological activities of one of the more completely and finely 
equipped of all these stations, Dumont D’Urville on the Adelie Land coast; and the French 
National Centre for Research is much to be congratulated in producing the volume so 
expeditiously and with such a stimulating mixture of text, photographs, diagrams and tables. 

It had been agreed among the nations that the detailed meteorological observations for 
all geophysical year stations should be reproduced on microcards by the World Meteorological 
Organization and many countries are likely to limit the publication of their material to this. 
France however, has decided to produce in addition a volume giving information about the 
equipment used for the observations of each meteorological element, both surface and upper 
air, the likely sources of error and suggestions for improvement in future similar work. 
This valuable information from Dumont D’Urville is given in the first part of the present 
volume, and the second part gives a most useful summary in a series of tables of means, 
extremes, etc., of the observations of the surface weather elements (pressure, temperature, 
wind, visibility and so on) as sent to the W.M.O. A third section does the same for the 
upper air data collected by radio sounding balloons in most difficult working con- 
ditions. The whole compendium forms a most impressive and important addition, in 
clear and workmanlike form, to the meteorology of the Antarctic continent: it is much to 
be hoped that many more countries will follow France’s example in publishing a volume 
on this pattern of this standard for their Antarctic and other Geophysical Year stations. 

jel SDAGG: 


Introduction to Elementary Particle Physics. By R. R. MarsHak and E. C. G. SUDARSHAN. 
(Interscience Tracts on Physics and Astronomy.) (New York and London: Interscience 
Publishers Inc., 1961.) [Pp. viii+231.] Paper, 19s; Cloth, 34s. 


This is an excellent little book written by two experts in the field of elementary particle 
physics. Its intention is to provide an introduction for the non-expert to the more per- 
manent features of theoretical high energy physics. ‘Thus it does not concern itself with 
detailed dynamical calculations which, in any case, can rarely be carried out with confidence, 
but only with the experimentally well-established invariance properties of elementary 
particle interactions. 

Assuming a knowledge of basic quantum mechanics the authors first present the main 
ideas of relativistic quantum mechanics and quantum field theory. ‘They then discuss the 
transformations associated with space inversion (P), time reversal (T) and charge con- 
jugation (C) with particular reference to the famous CPT theorem. Next, a treatment is 
given of the additive conservation laws such as charge, baryon, lepton, strangeness and 
hypercharge conservation, and finally a chapter deals with the invariance properties 
associated with isospin. 

A valuable feature of the book is that the reader is never left for long with abstract 
mathematical formalism but is continually brought down to earth by experimental examples 
illustrating the importance and various applications of the different conservation laws or 
invariance principles. This reviewer has nothing but praise for this interesting, well- 


written and very valuable tract which so admirably fulfils its stated intention. 
R. J. Biin-STOYLE. 


Electronics. A Bibliographical Guide. By C. K. Moore and K. J. SpENcer. (London: 

Macdonald, 1961.) [Pp. xvii+411.] 65s. 

This guide aims at providing easier access to the literature on electronics published 
before June 1959. The first section of 40 pp. gives details of all sources of information on 
electronics or related subjects—data books, histories, information services, periodicals, 
dictionaries, etc. ‘The remainder deals in turn with 67 different sub-sections of electronics. 


314 Book Reviews 


A few titles selected at random are radio astronomy, automation, infra-red, radio-com- 
munication, biological effects, circuit theory, electrical contacts, semiconductor physics, 
cathode ray oscilloscopes, amplifiers, computers. Under each section there are listed 
books, survey articles and specialist papers. These are chosen for their own content and 
also for their value as references to other sources. Such a selection must be somewhat 
arbitrary and it would be easy to criticise some of the inclusions and omissions. However, 
a few checks on subjects familiar to the reviewer revealed that selection appears to have been 
made with care and discrimination. It is unlikely that this book will provide the specialist 
with new information on his own subject but it will be of great value to him as a ready 
source of information on other branches of electronics. It deserves a place on the shelves 
of all libraries serving workers in the electronics field. The publishers and the authors are 
to be congratulated on this venture; it is hoped that it will prove successful and justify 
frequent supplements to keep it up-to-date. M. R. GAVIN. 


Proceedings of the International Symposium on Polarization Phenomena of Nucleons. 
(Birkhauser Verlag, Basle and Stuttgart, 1961.) [Pp. 436.] Fr. 65—(DM 65). 


Nuclei are built up from neutrons and protons (nucleons) each one having an intrinsic spin 
and one of the major problems in the field of nuclear physics is to obtain as much information 
as possible about the nature of the nuclear force between these particles. The force between 
two nucleons is of the order a hundred times stronger than the ordinary Coulomb force and, 
unlike the Coulomb force, depends not only on the separation between particles but also 
on the orientation of their spins with respect to each other and also with respect to the vectors 
defined by their relative separation and relative momentum. The nuclear force is clearly 
extremely complicated. 

In order to investigate the way in which this force depends on the nucleon spins it is 
necessary to study nuclear reactions and scattering phenomena using beams of particles in 
which the particle spins are mostly pointing in one direction only, rather than in random 
directions as is normally the case. Such a beam of particles is said to be polarized. 

In the summer of 1960 a symposium was held at Basle under the presidency of Professor 
P. Huber at which all the experimental and theoretical ramifications of this subject were 
considered. ‘The discussion ranged over the production of polarized beams and targets, 
the measurement of polarization, nuclear reactions and scattering involving polarized 
particles and the theoretical aspects of these various topics. Most experts in this field 
were present and this volume of the Proceedings of the Symposium gives a full report of 
the various contributions and represents a valuable and up-to-date account of the tremendous 
effort which is being expended in trying to tie down as firmly as possible the spin dependence 
of the nucleon-nucleon force. R. J. BLin-STOYLE. 


International Dictionary of Physics and Electronics (Second Edition). (D. van Nostrand 
and Co., 1961.) [Pp. 1355.] $27.85. 


The new second edition of this monumental work is up-to-date and comprehensive. 
It opens with a 36-page introductory survey of modern physics, and ends with glossaries 
in German, Russian, French, and Spanish. The physics articles are excellent, authoritative 
and instructive, and have nobly kept pace with all that has happened during the five years 
since the first edition appeared. The mathematical ones may be useful to those who have 
to read the more succinct kind of physical literature. Those on electronics seem, as I 
suppose they must to the outsider, to translate technical terms into jargon, or vice versa. 
There is a great deal of cross-reference, and a rough estimate suggests that entries like 
“ Potential, critical—see Critical potential ” must account for a hundred pages or so out of 
the total of over 1300. It might be worth while to streamline future editions, on the as- 
sumption that readers of a book such as this can find what they want without playing 
snakes-and-ladders. 


A valuable book for the reference library. G. R. Noakes. 


Book Reviews 15 


Shock Tubes. By J. K. Wricut. (Methuen’s Monographs on Physical Subjects.) 
(London: Methuen and Co., 1961.) [Pp. viit+164.] 13s. 6d. 


Shock tubes have been widely used in the last fifteen years, and books on the subject 
are now beginning to appear. There have been some excellent laboratory reports on 
operational techniques but this monograph is the first publication which can be recom- 
mended as a textbook to an undergraduate or research worker. An introductory chapter 
on fluid dynamics and the development of shock waves is followed by an outline of the 
theory of shock tube flow and a discussion on measurement techniques. The remainder 
of the book describes the experimental application of shock tubes to problems in aero- 
dynamics, chemistry and physics. 

Large-scale pressure driven shock tubes are used in aerodynamics to study the pheno- 
mena of hypersonic flow and in chemistry to measure rates of reaction and dissociation. 
Interest in plasma physics and thermo-nuclear fusion has led to the use of the heating 
properties of strong shock waves and to the development of small-scale electromagnetic 
shock tubes where the driving force is supplied by a magnetic piston. 

This book covers all of these topics in some detail, and is an addition to the author’s 
list of valuable contributions to the literature of the subject. 

H. J. Pain. 


The Abundance of the Elements. By LAWRENCE H. ALLER. (New York: Interscience 
Publishers, 1961.) [Pp. xi+283.] $10.0 


It is only in very recent years that the problem of the relative abundance of the chemical 
elements up and down the universe has drawn upon a sufficient variety of sciences to be 
worthy of a comprehensive treatise of readable character. The present enterprise by Aller, 
the well-known Michigan leader of stellar and other astronomical research projects, will be 
a stimulus to several fields in physics that are due for urgent exploration. 

It is, of course, well-known that stellar spectra dominated by lines of hydrogen or helium, 
or metal or oxide, do not primarily imply a star ‘ made of’ those materials, but rather a 
range of surface temperature and gas pressure capable of ensuring concentration of particular 
electron levels; for example there is a restricted range for the 2-quantum state of atomic 
hydrogen, outside which even a star ‘ made of? hydrogen would be too over-ionized or too 
under-excited to exhibit strong Balmer lines. But in recent years, second-order analyses 
of stellar spectra have enabled astronomers to distinguish from the dominant results of 
temperature and pressure the effects attributable to variations in actual relative abundance 
of the elements. This topic is shrewdly and clearly developed here by Aller. 

Terrestrial abundances from geophysics, and evidence of unexpected relative abun- 
dances in meteorites, are also discussed. A problem of fundamental importance, neither 
here nor anywhere yet adequately solved, is the apparently different distribution of atomic 
species in cosmic rays compared with solar and stellar as well as terrestrial sources: Aller is 
wisely critical of the very scattered types of evidence. In a final chapter on the genesis of 
the elements, we have a glimpse into that overlap of nuclear physics and theories of stellar 
evolution which is so fascinating but hazardous. Aller mentions even the clue which sug- 
gests that magnetic stars show nuclear reactions in their atmospheres, not merely in the deep 
interior. : MarTIN JOHNSON. 


Magnetisme Terrestre. Magnetogrammes Normaux et Semi-rapides, Chambon-la-Forét 
(April—December 1958). (Paris: CNRS, 1961.) [Pp. 287.] 35 NF. 


An earlier volume (Series III, Fascicule 2a) presented reproductions of the magneto- 
grams recorded daily at the French geomagnetic station at Chambon-la-Forét during the 
first half of the I.G.Y. This volume completes the project by giving reduced facsimiles 
of the normal and quick-run records for each day from April to December 1958. All the 
necessary information as regards sense of movement, scale and base-line values and time 
breaks is given for detailed interpretation of the earth’s magnetic field as recorded at the 
station. J. M. Srace. 


316 Book Reviews 


Progress in Optics Volume I. Edited by E. WOLF. (North-Holland Publishing Co., 1961.) 
[Pp. xii+342.] 70s. 
This new series, Progress in Optics, provides review articles on current researches in 

optics and related fields. The first volume contains eight articles dealing with Hamiltonian 
optics, optical design, aberration-free diffraction images, information theory, interference 
colour, visual processes and alignment devices. It is pleasant indeed to record that this book 
achieves that rarely attained objective of being both readable and informative without 
requiring the reader to be an expert on the topics discussed. Lites 

Perhaps the highlight of the volume is Professor Gabor’s paper which is the substance of 
his 1951 Ritchie Lecture—a much quoted contribution to light and information—which 
had hitherto remained unpublished. 

Some very careful thought has obviously been given to the selection and arrangement 
of the topics with the result that the volume is much more than just a collection of individual 
papers bound in a single cover. If future volumes maintain this high standard (and under 
Professor Wolf’s editorship one has every confidence that this will be so) this series will 
become of paramount importance not only to those actively engaged in optical research but 
also to those who have a general interest in the subject. 

The price is that which we have resigned ourselves to accept for this type of publication 
but there is certainly value for money. The first volume can be recommended not only 
for the library shelves but also for personal use. B. J. THOMPSON. 


A Treatise on Analytical Chemistry, Part II, Vol. 3. Analytical Chemistry of the Elements. 
Edited by I. M. Kottruorr and P. J. ELVING, assisted by E. B. SANDELL. (Interscience 
Publishers Inc., 1961.) [Pp. xvii+380.] 100s. 


This volume is part of a set, not completely specified, intended to present a “‘ concise, 
comprehensive and systematic treatment of all aspects of analytical chemistry’. Part I 
deals with the general theory and practice of analytical work and is to be followed in Part II 
by a systematic review of the analytical chemistry of each of the elements. Section A of 
Part II is devoted to the analysis of the inorganic compounds of the elements arranged on 
the basis of the Periodic Table. It should be noted that the claim of systematic presentation 
has now been cancelled on the plea of minimizing delay in publication. Section B of Part II 
will deal with the determination of elements in organic compounds, and Part ITI will discuss 
the analysis of industrial products. 

‘The newly issued work is Volume 3 of Part IIA and deals with the elements cadmium, 
copper, magnesium, mercury, tin and zinc. A short summary of the occurrence of the 
element, the industrial methods of extraction and its toxicology is followed by useful 
discussions on the sampling of ores and related compounds. An extensive treatment of the 
various methods of qualitative and quantitative determinations of the element and its more 
important compounds forms the major part of each section. Specialists in the study of 
particular elements were invited to compile the individual sections ; this has led to some 
variations in the method of treatment. 

The completed set will form a useful addition to University libraries; but can a work be 
said to be comprehensive when another multi-volume set on Analytical Chemistry by the 
same authors is advertised on the dust cover? It took the reviewer some time to realize 
the two sets of works were different. W. SIDDALL. 


Physics for Electrical Engineers. By W. P. Jotty. (English Universities Press, 1961.) 
[Pp. xii+308.] 21s. 


In his preface the author states that the ‘‘ aim of the book is to provide a coherent 
account of the nature of matter and energy which is fairly simple, without being too super- 
ficial. But... it must provide the right material and worked problems to ensure that the 
reader should be able to pass the Engineering Physics examination of the LE.E. ...” 
The author is a teacher of experience and one must respect his intentions. Unfortunately 
physics is not simple and there is little point in pretending that it is. Physical concepts 


(even some quite elementary ones) present a challenge to the imagination that is altogether 
missing from this book. iE \Weanlene. 


Book Reviews 317 


National Physical Laboratory. Notes on Applied Science. No. 23. Radiometric Standards 
and Measurements, [pp. 23]; 1s. 9d. No. 24. Photometric Standards and the Unit 
of Light, [pp. 32]; 3s. 


The first of these short monographs describes the methods used at the NPL over the 
range 0:25-0:74. These cover: (1) The measurement of total radiation using calibrated 
thermopiles of the Moll or Schwarz type and Callendar’s radio-balance. (2) Mono- 
chromatic measurements, and the calibration of a photocell against a thermopile. (3) The 
absolute measurement of radiation using the Guild drift radiometer, in which a system of 
fairly large thermal capacity is exposed successively to the radiation and to electrical 
heating, and the latter adjusted to give the same rate of temperature rise. 

The second monograph deals with the establishment of the primary standard, a full 
radiator at the temperature of solidification of platinum which has a luminance of 60 
candelas/sq. cm., and with the methods employed to calibrate secondary standards in terms 
of this. Some interesting elementary points are raised—that, for example, the initial 
comparison with the primary standard is made in terms of point-source luminous intensity, 
while in practice many sources (particularly gas discharge lamps) are so extensive that the 
only measurable is the integrated luminous flux. So two sets of secondary standards, one 
for luminous intensity embodying the candela (colour temperature 2042°K and 2353°x) and 
one for flux embodying the lumen (colour temperature 2353°K and 2788°K) are maintained. 
The Lummer-Brodhun photometer is used for some visual comparisons with the primary 
standard; at least, I think so. On page 7 it is stated that visual methods have long since 
been replaced entirely by photoelectric ones, and there is a reference to the demise of visual 
photometry; on page 11, the use of the Lummer-Brodhun is described, and a diagram of 
the process is given. It is rather ambiguous; one would like to know whether the old text- 
book friend is really obsolete or not. 

The appendix suggests the possible future trend of photometry—an eye for an eye, as 
it were—using photocells with spectral response matching that of the eye, and calibrated 
absolutely by the methods described in the first monograph. G. R. Noakes. 


Meteorologie: Données climatiques a la station Charcot et a Vinterieur de la Terre Adeélie 
(1957=1958) 5 (Paris. C NIRS 1961") [Pp 354), LONE. 


In addition to maintaining an important I.G.Y. base at Dumont d’ Urville on the Adélie 
Coast of Antarctica, France occupied a station for meteorological observations at Charcot 
300 km inland, in latitude 69°S and 2400 m up on the plateau. This volume gives an account 
of the programme of work carried out; the equipment used and some notes on difficulties 
encountered in using it. In the form of tables and diagrams without discussion it presents 
a comprehensive summary of the information collected in the main surface meteorological 
elements and from pilot and sounding balloons. ‘There is a section on the observations 
made on the journeys between the base and Charcot with an extension southward to 71°S. 

Like the parallel volume on the meteorological activities at Dumont d’Urville, this 
Charcot account gives all its information in an enticingly compact and clear form with no 
elaborations or embellishments. It is only from a scrutiny of the tables that the reader 
gets some appreciation of the rigour and severity of the conditions in which the observations 
were made. Throughout the period of occupation of the station from April 1957 to Decem- 
ber 1958 the monthly mean temperature never exceeded —21°c, and every month except 
February 1958 had many days when winds greater than 30 kt were recorded. In June 1958 
the average temperature was —50°c and the average wind speed was 23 kt. In such 
conditions it is no wonder that defects developed in many of the instruments and modifica- 
tions had to be made or replacements introduced in order to maintain the high standard of 
accuracy set for the observations. The experience gained should lead to improvements 
for similar work in the future, though it is astonishing that so many of the difficulties en- 
countered at Charcot are exactly those that have been noted by earlier expeditions in similar 
though perhaps not quite so extreme conditions. 

The volume is published by the French National Centre for Scientific Research as 
Fascicule 2 of Series II: each of these valuable volumes should carry a statement to indicate 
what the various series and fascicules, already published and planned, contain. 


J. M. Stace, 


318 Book Reviews 


Properties of Matter (Second Edition). By H. N. V. TEMPERLEY. (University Tutorial 

Press, 1961.) [Pp. vii+329.] 21s. 

The first edition of this book appeared in 1953 and will be familiar to-many readers. 
In this second edition there is a new section on ultrasonics and an additional chapter on 
space research. Both additions are very brief and do not add significantly to the subject 
matter of the original. Although written with the needs of University students in mind 
the scope of this book is such as to make certain sections suitable for candidates for the 
Advanced and Scholarship Level Examination. 

It is the reviewer’s opinion that this is one of the most excellent books of its kind to have 
appeared in print. All too often this subject is presented in the form of a list of applications 
of Newton’s Laws of Motion together with some rather special empirical laws which 
describe the behaviour of certain types of materials. It is hardly surprising that students 
often find this desiccated skeleton uninteresting and are led to dismiss the subject as a bore 
without really discovering the first thing about it. Dr. Temperley shows them just what 
they are missing and throughout this book one is constantly being made aware of the 
problems which still remain to be solved and the way they are being tackled. As a result 
we have a book which is not merely informative (for all the standard derivations are to be 
found here) but one which is also fascinating to read. 

The printing is a little old-fashioned but this is an excellent book and wonderful value 
for money. E. W. LEE. 


Plasmas and Controlled Fusion. By Davin J. Rosz and MELVILLE CLark, JR. (New York: 
M.I.T. Press and J. Wiley & Sons, 1961.) [Pp. 493.] 81s. 


This is a textbook which is used at M.I.T. in a one year course for graduates of physics 
and engineering. Except for a short survey of ordinary and nuclear fuels, the first half of 
the book is concerned with the components, motion and general properties of a plasma 
whereas the second half deals with applied plasma physics and fusion machine studies. 

The main chapters cover collisions between particles including the relevant cross 
sections, some surface effects and the yields of secondary emissions, distribution functions, 
diffusion processes and the rates of fusion reactions. The chapters on Maxwell’s theory 
and hydrodynamics lead to the treatment of the bulk motion of a plasma and the different 
types of waves pervading it. Next screening, scattering, relaxation times, resistivity and 
runaway electrons are briefly dealt with. Radiation and other losses, plasma stability and 
materials for thermo-nuclear reactors are then discussed and finally special problems in 
apparatus using pinch and mirror effects and stellerator geometry. 

Because of the scope of the book the choice of topics and the appropriate depth of treat- 
ment must have given the authors some headaches. This may explain why the emphasis 
in certain chapters was put more on technical data than on physics. In spite of this the 
presentation is on the whole exceptionally clear and contains much original thought. 
Numerical problems are given at the end of each chapter as well as an extensive bibliography. 
Yet, throughout the book looms the thermonuclear fusion ghost hotly pursued by the 
superintendent of the Hades Science Library Inc. The apparently high standard in 
classics and literature of the physics graduates is shown by numerous quotations from 
Ben Jonson’s Alchemist and Leonardo da Vinci’s Anatomia. The parting shot is Aristotle’s 


avayKn OTHVaL. A. VON ENGEL. 


Ripple Tank Studies of Wave Motion. By W.Luowarcu. (Oxford University Press, 1961.) 
[Pp. vi+66.] 15s. a 
This book gives an excellent account of the use of a ripple tank in the study of wave 

motion. After discussing the usual shadow technique and its limitations, the author gives 

an account of his use of the schlieren technique for observing the Vopee the advantages 
of the schlieren method are explained. There follows an account of the nee of the ripple 
tank to demonstrate reflection, refraction, interference, and diffraction. The passage of light 
through lenses and prisms is simulated by using troughs of carbon tetrachloride within the 
main water tank. ‘The diversity of the applications can be illustrated by noting that among 
the many phenomena successfully demonstrated there are the introduction of wavefront 
aberration by a simple lens, the formation of interference fringes with a biprism, and the 
change of phase that occurs as a converging wave passes through a focus. The text is 


accompanied by over 60 excellent photographs. R. S. Loncuurst 


Book Reviews 319 


An Introduction to Magneto Fluid Mechanics. V. C. A. FERRARO and C. PLUMPTON, 
(OPP. 1961); ([Ppevili-+-18ic] 255; 

This book is essentially based on a series of lectures which the senior author has given 
at the Summer School in Varenna and other places. The first section is an excellent 
resumé of earlier work in astrophysics and allied fields. Some scientists may take exception 
because credit is not given to discoverers of physical phenomenona, as for instance the pinch 
effect and plasma oscillations, but rather to theoreticians for the most elegant, lucid and 
complete mathematical treatments. (‘ J.J.’ would certainly have resented his name being 
frequently misspelled.) 

The first half of the book deals with magneto-hydrodynamics and in particular with the 
theory of a conducting fluid in a magnetic field, the magnetic Reynolds number, interactions 
and forces, and the stability of various configurations. It ends with two pages on practical 
applications. Subsequent sections contain treatments of the various types of continuous 
wave turbulence and shock wave. Then follows a discussion of the motion of charged 
particles in magnetic fields; and of the dynamics of a plasma, which means the velocity 
distribution, the relaxation time to reach the steady state, the pressure tensor in a gas, effects 
of collisions, electric and thermal conduction and similar problems. This section of the 
book which follows the lines of Spitzers’ monograph, is very short indeed. A chapter on 
electrostatic, electromagnetic and hydrodynamic waves concludes the treatise. It is inter- 
esting to note that large mathematical guns are trained on these problems but that in the 
final phase of the battle small arms fire is used. 

A bibliography includes the outstanding theoretical papers published during the last 
ten years as well as useful remarks about some of them. 

Though purists may object to the change from M.K.S. units in part I to Gaussian units 
in part II, the book can be recommended strongly to all those who have some knowledge of 
vector analysis, particularly to theoretical physicists and scientific engineers. 

A. Von ENGEL. 


Science Masters’ Book, Series IV, Part I—Physics. (John Murray, 1961.) [Pp. xiv+281.] 
Diss 

This book follows the pattern of its predecessors in grouping for easy reference some of 
the more valuable experiments and demonstrations from Nos. 101 to 142 of the School 
Science Review. The standard is well maintained, and while selection is always an invidious 
task, there seem to be few unnecessary omissions and little duplication. At one point the 
use of a lethal high voltage source is advocated; the non-lethal RF generator described in a 
later experiment is more satisfactory, and provides a useful, safe and convenient source of 
high potential. It is a pity, too, that some of the practical difficulties which arise are not 
considered more fully: many of the experiments are a good deal less straighforward than 
the accounts indicate. There are a few cases in which results are treated badly, and in 
three of the experiments the theoretical basis is doubtful; only two minor typographical 
errors were noted. 

The section on plastics should prove particularly useful, and the grouping of subject 
matter according to the level for which it is intended is more convenient than the arrange- 
ment used formerly. Altogether this is a most satisfactory book, which should be available 
in every school. R. W. Moore. 


Dispersion Relations and Elementary Particles. Edited by CeciLte Dr Witt. (John Wiley 

& Sons, 1961.) [Pp. 671.] 160s. 

Proceedings of conferences and summer schools devoted to the fast expanding subject of 
dispersion relations are coming fast and furious from the presses, and any student wishing 
to get acquainted with the subject—which can be traced back to a paper in 1871 by Sellmeier 
—has a wide choice of introductions at various levels. The present volume gives an admir- 
able survey of the field as of the summer of 1960, and will undoubtedly and deservedly find 
its place on the shelves of physics libraries. The eminence of the contributors ensures 
the high standard for which the Proceedings of the annual Les Houches Summer Schools 
are renowned. However, is it necessary to price the book out of the libraries of all individual 
research workers? Moreover, why should a book which is advertised in the United States at 


$20 cost £8 in Great Britain? And why is it impossible at this price to provide an index? 
D. TER Haar, 


320 Book Reviews 


Theory of Direct Nuclear Reactions. By W. 'TOBOCMAN. (Oxford University Press, 1961.) 
[Ppa d0saie ss. 


Nuclear reactions can be divided roughly into two classes—those in which all the nuclear 
constituents play an active part and those in which only one or two participate directly, the 
remainder providing a passive but important background potential field. It is with reactions 
of this latter class—the so-called direct reactions—that this little book is concerned. 

Professor Tobocman has been actively engaged in research in this field for many years 
and in this book he gives an unpretentious and clear account of the theoretical treatment of 
such reactions, showing how to calculate angular distributions, cross sections, polarizations 
and angular correlations for the various types of direct reaction. His treatment is in terms 
of the distorted wave Born approximation but he does give a brief discussion of other 
approximation methods. 

The book requires of its reader a sound knowledge of quantum mechanical scattering 
theory and is eminently suitable not only for the theoretical graduate student embarking 
on research in this field, but also for the experimenter concerned with the interpretation 
of his experiments in terms of standard theory. ‘T'ypographically, the O.U.P. have made 
an excellent job of some extremely complicated formulae, although in this review copy the 
type on one side of a number of pages was clearly visible from the other, leading to no little 
confusion. R. J. BLiIn-STOYLE. 


Nuclear Power Today and Tomorrow. By KENNETH JAy. (Methuen, 1961.) [Pp. 270.] 
DS. 


This book is concerned with the principles and practice of nuclear reactors. It starts 
with an outline of the relevant concepts of nuclear physics, and then deals with the various 
types of nuclear reactor—first in general terms, and then in detail. The economic 
aspects of nuclear power are also amply covered. 

The work is authoritative, and the style lucid. Clear diagrams, and a large number of 
plates, add to its interest and intelligibility. 

According to the foreword, the book is . intended to give the layman a better 
understanding of the nature of nuclear power and to explain some of the major problems 
which have to be overcome in making practical use of it.” This is perhaps a little opti- 
mistic: for although the treatment is non-mathematical, there is enough technicality to 
cause a layman to have to put considerable effort into following it in detail. The book is 
more likely to be of value to readers who are reasonably grounded in the elements of physics 
(excluding nuclear-reactor engineering), and who are interested in obtaining a fairly detailed 
non-specialist knowledge of the important subject of nuclear power. 

The book ought therefore to be made widely available in libraries, including those in 
schools and technical colleges. M. S. SMITH. 


“cc 


Kinematics of Nuclear Reactions. By A. BaLpiIn, V. I. GoLpanskir and I. L. ROZENTAL. 
Translated by R. F. PEreRts. (Oxford University Press, 1961.) [Pp. 223+ 57 tables.] 
38s. 


Any translation of a modern Russian textbook is to be widely welcomed, as the Iron 
Curtain too often hinders the complete exchange and cross-fertilization of ideas—nowadays 
mainly because of the language barrier, as it is getting increasingly easier to obtain Russian 
books in the West. ‘The present translation of a book which was published as recently as 
1959 is the more welcome as it is by somebody who is not only well acquainted with the 
language of the original, but also with the subject matter. The book is concerned with those 
relations between momenta, energies, and directions of emission of the particles produced 
in nuclear reactions which can be deduced from classical and quantum-mechanical con- 
servation laws. ‘The text is supplemented with useful tables of Clebsch-Gordan and Racah 
coefficients, and of other coefficients occurring in expressions for reaction cross sections, and 
also with a large number of nomograms giving relations between energies and angle of 
emergence for a number of reactions, and similar relations. This volume should prove to 
be useful as a handy reference book, but surely a publisher of the standing of the Oxford 
University Press should have asked the translator to provide an index! D. TER Haar. 


Electrons in Metals: A Short Guide to the Fermi Surface 


by J. M. ZIMAN 
University Lecturer in Physics, and Fellow of King’s College, Cambridge. 


Part II. Banps AND ZONES 


1. DEFICIENCIES OF THE FREE-ELECTRON MODEL 


A gas of free electrons in jellium is an excellent model for the electrical 
properties of metals. Indeed, it is so good, and so easy to use in practice, that 
extensive calculations have been made of all sorts of properties in addition to 
those which were discussed in I, and the same sort of semiquantitative agreement 
with experiment is found. 

But really it is a bit bogus. How can we ignore the field of the ions—a field 
which is locally so strong that it would bind an electron in a free atom? Surely 
the conduction electrons must be scattered by the ions. How is it possible that 
this is not observed as a large constant electrical resistance? Then there are 
the experimental anomalies, such as positive Hall coefficients and positive thermo- 
electrical powers, which can by no means be explained within the free-electron 
model. 

It is essential that we deal with the problem of the motion of an electron in 
the field of the lattice of ions. ‘This seems a formidable task—which certainly 
it is if one wants to construct a programme that will print out values of the 
electrical conductivity and thermo-electric power of a metal, having fed into the 
computer only the potential field of each ion and the arrangement of the ions in 
the crystal lattice. It is only in the last few years that such direct quantitative 
calculations have produced results that are recognizably like the observed 
properties. But the formal structure of the theory has been well understood for 
the past 30 years, and it provides, in principle, answers to all the problems 
which I have just mentioned. In parts II and III, I shall try to explain this 
subtle and elegant theory, and to indicate the physical significance of its results. 


2. WAVE-PROPAGATION IN A LATTICE 


In our free-electron system, the electrons are represented quantum- 
mechanically by simple plane waves. Let us suppose that the atomic potentials 
are small, just to see their general effect on the wave functions. We might 
assume for example, that each ion is a small sphere, inside which the potential 
is constant, but a bit different from its value in the free space between the spheres. 
Each sphere will then scatter the electrons according to well known laws of 
diffraction theory. 

But the spheres are arranged on a regular lattice—for the sake of argument, 
ona simple cubic array. The scattered waves from different atoms will tend to 
interfere with one another in a systematic way, so that there may be coherence 


in the total diffracted wave. This is precisely the same situation as for the 


CrP. Xx 


322 J. M. Ziman 


aS Z 
| Z 
oo 
ee 
(a) s 
(Db) 
Fig. 12 (a) Scattering of electrons by a (b) Diffraction by a regular array of atoms. 


single atom. 


diffraction of x-rays, or fast electrons, by the crystal. It is well known that an 
x-ray beam will usually pass right through a crystal unless it has the wavelength 
corresponding to Bragg reflection, i.e. such that (see fig. 13) 


nr =2a cos 6 (1) 


Fig. 13. Geometry of Bragg reflection. 


For our electrons the very same law must hold. Using the de Broglie formula 
for the wavelength in terms of the momentum, we can write the condition in 
the form 

pa cos 0=n7h. (2) 


Any electron travelling in the direction @ relative to the lattice planes, with 
momentum p satisfying this condition, will be coherently reflected by the crystal 
lattice. For such an electron a simple travelling wave cannot possibly be a good 
representation of the wave function of a stationary state, for this electron will be 
continuously suffering internal reflections at the lattice planes. 


3. PERIODIC POTENTIALS IN ONE DIMENSION 


To simplify the discussion let us consider the case where the electron is 
travelling normal to a set of lattice planes. We then have, in effect, a one- 
dimensional problem, in which we are trying to propagate a wave through a 


Electrons in Metals 323 


linear array of objects, as in fig. 14. We need not concern ourselves in detail 
with the actual potential function for each such object; a set of square wells will 
do to simulate the obstacle presented by each lattice plane to the path of the 


(| | 
\ | 
| 


wee 
25 


Fig. 14. Periodic potential in one dimension. 


electron. The main point is that this potential repeats itself regularly in the x 
direction, with spacing a between the planes. 

To eliminate the constant % from the algebra let us write p=/ik, so that k 
is the wave-vector, or, in our one-dimensional case, k is the wave-number, of 
the electron wave function. ‘Thus, we start with a plane wave “ travelling to the 
Feht 

bn =exp (thx). (3) 
If the electron is moving very slowly, & will be so small that the Bragg condition 
will not be satisfied. But for 


k=rla (4) 
the wave will be coherently reflected. This means that the wave function is 
transformed into 

p_r~=exp (—zkx), (5) 
which represents a wave travelling to the left. (The time factor in these waves. 
is always assumed to be exp(—zEt/h), but we need not include it explicitly in 
the formulae). We cannot avoid a mixture of such terms in the final wave 
function. Well then, consider uniform mixtures like 

ne 54 exp (ix) + exp ( —ikx)| ™ ( es ay (6) 
Since the scattering is symmetrical, to right or to left, ~, and ¢_ will remain. 
functionally unchanged by Bragg reflections. ‘The effect of such scattering is. 
simply to interchange the two travelling waves in the sum or difference. For 
this particular value of k, it looks as if the appropriate quantum—mechanical 
wave function for an electron in a steady state is a standing wave. 

Now we could, of course, always construct such standing waves for free 
electrons by combining the two travelling waves as in (6). This is always 
permissible, because for free electrons both %, and 4_ have the same energy. 
But in our periodic potential these two standing wave functions have different 
energies. We can prove this by actually calculating the average value of the 
potential energy of the electron in each state: 


al 9 
Vi=z| lvl V(x) dx. 


= [ (oS ie ) Oe (7) 


The integration is over the length, L, of the ‘ crystal ’. 


324 J. M. Ziman 


It is obvious from the Bragg condition (4) that | % |? has the same periodicity 
as the potential function V(x), so that these two integrals do not vanish. If we 
suppose that the mean value of V(x) itself is zero (this is purely a matter of defining 
the energy zero of our system), then the two integrals will be equal in magnitude 
but opposite in sign. If, as in fig. 15(a), the peaks of electron density coincide 


Fig. 15. (a) Wave-function with high electron-density in the potential wells. (6) Wave- 
function with high electron-density between ‘ atoms ’. 


with the centres of the potential wells the value of the integral will be negative; 
in the other case, where the electron density is zero at the centre of each well, the 
potential energy will be positive. In fact 


V2 a at | Vg |, (8) 
where Vy is the Fourier component of the periodic potential at its fundamental 
wave number 

B= 27 /a. (9) 
The two states 4, and #_ will have the same kinetic energy as for free electrons: 
6% =h?k? 2m. (10) 

Thus, the total energy of these two states is given by 
6x4) =HPRY[2mz | Vo | (11) 


In other words, the state %, lies significantly lower in energy than a single free- 
electron state. It is therefore much more acceptable as a wave function for the 
electron.t But this state is a standing wave, so that the electron is effectively 
bound to the lattice, and is unable to travel as a wave packet throughout the 
crystal. 

The result that we have just derived refers only to waves which exactly 
satisfy the Bragg condition—in other words, where k= 7/a=}g. It is scarcely 
to be expected, however, that other waves will proceed through the lattice un- 
modified. ‘The lattice is itself rather like a wave, of wavelength a, i.e. of wave 
number g. An electron wave tends to ‘interfere’ with this built-in potential 
wave, so that the state with wave-number k tends to be transformed into a wave 
of wave-number k+g or k—g, etc. ‘This is just what happens in Bragg reflection, 
where k=ig, and the reflected wave corresponds to k +g=tg-—g=—-ig=—-k. 

This suggests that the proper solution to the problem in the general case 1s 
not a travelling wave exp (7kw), but a linear combination of this with all possible 
“reflected ’ waves i.e. 


be = a exp (tkx) + 6 exp {i(k —g)x} + y exp {i(k Ag) R boston (12) 


-f- . 

| Of course yw and _ are not exact solutions to the Schrédinger equation in a 
potential such as fig. 15. One can easily construct such solutions in a series of square wells 
by joining together pieces of sinusoidal wave at each discontinuity of V. But (11) is a good 


approximation to the energy so long as | V, | is not too large, and the exact solution is still a 
standing wave. 


Electrons in Metals 325 


where the coefficients are to be determined. ‘This is a generalization of (6)— 
but we can no longer suppose that « and f have the same magnitude. This is 
because the ‘reflected’ wave, with momentum (k—g)f, does not now have exactly 
the same kinetic energy as the original incident wave, so that it can only be present 
“virtually > in the wave function. It must quickly be transformed, by a second 
reflection, back into the original wave, and will not therefore be present, on the 
average, with large amplitude. 


As an approximation, let us take only the terms in k and k—g in this sum, for 
these are the two which are most nearly equal in kinetic energy. We can easily 
calculate the total energy in such a state. The kinetic energy is just 
h2R2 h2(k — g)? 

+ B? (k-g) 
2m 2m 
since a” and f? represent the relative probabilities of finding the electron in 


these partial states. The potential energy comes out in the same way as in (7). 
Thus 


| | bx |? V(«)dx = (a? + B?) | V(x)dx + «B | (ex (igx) + exp (—igx)} V(x)dx (14) 


= —208| Vo | 
because the average value of V(x) is defined to be zero and the second integral 
is again just the principal Fourier component of the periodic potential. 
We add together (13) and (14), and then choose the ratio «/8 to make this 
total energy a minimum. ‘The result is, by elementary algebra, 
E 4 =4F(E% + 6 K_g)—4-V/ { (6 Ke —E%%_g)? +4| Vg |?}. (15) 
This state has lower energy than the free wave and is therefore a more accept- 


able wave function for the electron in the lattice. Again it is not an exact formula 
but has all the important features of the exact solution. 


a2 


= 26% + BPO _g (13) 


Energy gap 


Fig. 16. Energy as a function of wave number for electrons in a one-dimensional 
periodic potential. 


326 J. M. Ziman 


4, BANDS AND GAPS 


To show the significance of this formula, let us plot & as a function of k. For 
reference, we first sketch in the free-electron parabola 6% =?k?/2m. Then we 
draw vertical lines at k= + 4g, where Bragg reflections occur. ‘The energy here 
is depressed below the free electron parabola by the amount |V,|. If we 
evaluate (15) for other values of k, we find a function that starts almost exactly 
on the parabola at k=0 but draws away from it smoothly until it reaches 
6% —|Vql|at +4g—the points we have just plotted. If we were to look at the 
ratio B/« for different values of k, we should find that it was nearly zero near 
k=0—i.e. the wave function is nearly a single travelling wave exp(¢kx)—and 
that this ratio increases to unity at k=4g—the reflected wave becomes more 
and more important until finally the travelling wave is brought to rest. We 
shall show later that this is no metaphor; as we approach Bragg reflection, the 
velocity of the electron falls to zero. 

Nevertheless, there is no ‘ scattering’ of the wave. If we start at the left 
hand end of the lattice with a wave which is the proper mixture of exp(zkx) and 
exp {i(k —g)x}, but which corresponds, nevertheless, to a net flux of electrons to 
the right, then this wave is not attenuated by passing through any number of cells 
of the lattice. We shall show later that the velocity of propagation of wave 
packets along the lattice is not as high as it is for free electrons, and may even 
vanish for special values of k, but there is no question of the flux diminishing 
with distance as we go along the crystal, and no sign of a wave packet being 
reflected back, as it would be by an isolated square well. This gives us an answer 
to one of our basic difficulties: we have found that there exist, in a regular periodic 
lattice, many solutions of the Schrodinger equation which are uniformly propagated 
without scattering or attenuation, and which can therefore carry an electric current 
without showing electrical resistance. ‘Vo that extent, the conduction electrons 
do not ‘see’ a perfectly regular lattice. 

But now let us look at equation (15) from the point of view of energy, and ask 
what the electron state is for a particular value of 6. Near & =0, these are free- 
electron states. As the energy increases, the wave functions become more and 
more distorted, until, at k=}g, we have only a standing wave. Now what 
happens for higher values of 6? There is no solution of (15) in this neighbour- 
hood. ‘The fact is that, for a whole range of energy there is no solution of the 
Schrodinger equation. 'The next value above the level A at which a solution is 
allowed is at the level B, which corresponds, in fact, to the state w_ of (6). This is 
also a standing wave, now exactly out of phase with the lattice, and therefore of 
energy +| Vg | above the free electron parabola. From here on, a continuous 
range of energy is allowed, corresponding to the function (15) with a positive 
sign before the square root. If we take these to be labelled by ever-increasing 
values of k, we soon find that we are back near the free-electron parabola, and, 
indeed back to nearly free-electron wave functions (fig. 16). 


The effect of a Bragg reflection is thus perfectly simple. The relation 
between energy and momentum is no longer a continuous function. At the 
point where Bragg reflection would occur, the curve is split, with a jump dis- 
continuity of magnitude 2 | Vy |. States of wave number less than 3g are lowered 
in energy; those of greater wave number are raised. An energy gap is created 
within which there are no solutions of the Schrédinger equation. If we look 


Electrons in Metals S27, 


in detail at what happens when we try to construct such states, we find that 
“k must become an imaginary number’. This is tantamount to saying that 
these solutions are real exponential functions, which will grow or decay so rapidly 
with distance that they cannot be normalized or stabilized except near the ends 
of our linear array of atoms. These are the ‘ Tamm states’ or ‘ surface states ’. 
They play a role in some semiconductor phenomena but are irrelevant to the 
properties of bulk metals. 

This theory of wave propagation in a periodic one-dimensional array is not 
unique to metals. Readers may well be familiar with the ‘ pass bands’ and 
“stop bands’ of frequency in the propagation of alternating currents through an 
array of electrical filters. The argument is precisely the same: for ‘ frequency ’, 
now read ‘energy’. ‘The elegant book by Brillouint brings out the analogy 
to the full. 


5. BRILLOUIN ZONES 


In § 2 we started out to study wave-propagation in a three-dimensional array 
of atoms, but found the problem a little too ambitious. We simply remarked 
that free electrons would suffer Bragg reflection if their de Broglie wavelength 
were equal to twice the projection of the spacing of a set of lattice planes on their 
propagation vector. But in § 3 and § 4 it was shown that the analogue of Bragg 
reflection in a one-dimensional lattice gave rise to a gap in the energy spectrum. 
Suppose now that we consider propagation in various directions, with various 
values of momentum, through a set of lattice planes—for example a set of planes 
spaced a apart, normal to the x axis (fig. 17). Let us use a wave vector k=p/h 
to specify the electron states, thus avoiding the word ‘momentum’ (which 
sounds too ‘ dynamical’ in the present context) and also an extraneous factor /h. 
The condition for the ‘ first’ Bragg reflection now reads 


ke=+7/a. (16) 


¢a) (db) 


Fig. 17. (a) Propagation through a set of lattice planes. (6) Bragg reflection occurs 
when the k-vector lies on one of the planes ky= +7/a. 


But now in ‘ momentum space ’, or, what is now more appropriate, ‘ k-space ’, 
this relation has a simple geometrical significance; the tip of the vector k lies on 


+ L. Brillouin Wave Propagation in Periodic Structures (New York: McGraw-Hill). 


328 J. M. Ziman 


a plane normal to the x-axis, at a distance +7/a or —7/a from k=0. An 
electron whose wave-vector reaches this plane will be Bragg reflected by the 
crystal lattice. In one dimension the analogue of these planes are the points 
k= +7/a on the k-axis—and at these points we found an energy gap, bounded 
above and below by standing-wave states. 

This suggests that the planes defined by (16) are the locus of an energy dis- 
continuity in k-space. That is to say, when we do, with great labour, set down 
and solve the Schrédinger equation for an electron in our lattice of ions, and find 
a function, & (k), which measures the energy in terms of the wave vector of the 
state, this function will be continuous from k=0 up to these planes, and then 
will jump to a higher value. We can almost prove this directly by thinking that 
this set of lattice planes will scarcely affect wave propagation normal to the 
x-direction, so that we can choose our y- and z-components of k, and write down 
the corresponding kinetic energy terms for plane waves, independently of kz, 
and then we can think of the problem of constructing a suitable factor in the 
wave function for propagating in the x-direction just as if we were dealing with 
a one-dimensional system. 

However, an ordinary simple cubic lattice has many different sets of lattice 
planes. The ones that naturally suggest themselves are the cube planes, spaced 
a apart along the x-, y- and z- axes. But now in k-space there will be three sets 


(a) Cb) 


Fig. 18. (a) Propagation through a square array. (b) The Brillouin zone for a square array. 


of discontinuity planes. forming a cubical box as in fig. 18. We call such a box 
a Brillouin zone, or simply zone. ‘The planes of discontinuity of energy are the 
zone boundaries. 

Here at last we are getting somewhere near the heart of the modern theory 
of metals. Let me repeat what we have done. We propose to study electron 
wave-functions in a periodic lattice. These are non-local wave functions spread 
through the whole crystal, so that each can have a well defined momentum or 
k-vector. We therefore represent each state by a point in momentum space or 
‘k-space.’ Because of coherent diffraction of the electrons by the regular atomic 


Electrons in Metals 329 


planes of the crystal, some states are very strongly affected by the periodic 
potential, and are very different from free-electron waves. The states which are 
most strongly diffracted, and whose energy is changed the most (up or down) 
are those whose k-vectors correspond to points on a set of planes which delineate 
a cell in k-space. This cell is a Brillouin zone. 


6. METAL OR SEMICONDUCTOR 


Brillouin zones have another property that is rather important. In I § 7 we 
considered the ‘ allowed momentum states’ of a free-electron gas in a large box, 
so that we might count the number of states and find their energy. It was shown 
that the components of the momentum must be integral multiples of 27 /La, 
where La is the length of the side of the big box representing the whole metal 
specimen. In terms of the wave vector 


2 2 2 
Toe ka) =(Se7" iSyes s.7"), 


y ) La (17) 


where Sz, Sy, Sz are integers. 


But, of course L is a large number, so that this distribution of ‘ allowed states ” 
in k-space is very fine-grained; for most practical purposes we take it as continu- 
ous. ‘This set of rules will still apply when we treat a real metal, with a periodic 
lattice, in which the electrons are no longer quite free. Dynamical momentum 
itself is not a good quantum number, because of the interaction of the electrons 
with the lattice, but the vector k, which has many analogous properties, can 
be shown to be quantized according to the rule (17). (Some people refer to hk 
as the crystal momentum). 'This only means that we have to fit integral numbers 
of wavelengths into the box, to satisfy the boundary conditions. 

Now suppose that we have a simple cubic lattice, in which a is the lattice 
spacing. If we compare (16) and (17), we see that there is a close connection 
between the Brillouin zone boundary and the allowed states. ‘The zone boun- 
daries occur when the integers Sz, Sy, Sz have the values +4L. The zone is thus 
divided into a fine mesh of ‘ momentum cells ’, each corresponding to an allowed 
state (fig. 19). This is the same sort of picture as fig. 9 of I. 


By 
BEE he, 
I PY 
1 
Kb of 
an, tt +H 
fa] > kK 
+ t fall | 
fel HI 
eesaafint 
| 
Vv 
< 2% = 
Fig. 19. ‘ Allowed ’ momentum states in a Brillouin zone, 


The important point is that our cubical zone contains almost exactly L* such 
‘allowed states’. Between —}L and 4L, we have at our disposal L values of 


330 J. M. Ziman 


Sz, L values of Sy, and L values of Sz. In other words—a Brillouin zone in k 
space contains as many ‘ allowed momentum states’ as there are unit cells in the 
crystal lattice. 

: It is important to grasp this clearly, as one may easily be muddled by the 
two different types of construction in k-space. The Brillouin zone is an in- 
variant geometrical object. It is a polyhedron whose shape and dimensions 
depend only on the orientation and spacing of the lattice planes of the material. 
It is the same whatever the size and shape of the lump of material that is being 
studied. But the dimensions of the lump, and the boundary conditions on its 
surface, determine the fineness and detailed arrangement of the mesh-points 
of allowed states in k-space. This might be awkward; we might have to specify 
the size of our specimen in each experiment if it were not for two simple rules: 
the distribution of mesh points is always of uniform density in k-space, and the 
Brillouin zone always contains exactly as many points as there are unit cells in 
the specimen. From these rules we can enumerate states very simply. If 
our specimen contains N unit cells then we know that a fraction « of the volume 
of the Brillouin zone in k-space will contain aN allowed states of momentum. 
With two alternative states of spin, this volume could contain 2wN electrons. In 
other words—a fraction « of the volume of a Brillouin zone can accommodate 
the electrons that would be in the solid if there were, on the average, 2a electrons 
per unit cell. 

This suggests an interesting possibility. Suppose we have a divalent 
element which crystallizes into a lattice with one atom per unit cell. Then there 
will be exactly 2 electrons per unit cell—just enough to fill a whole zone right to 
the corners. ‘This was the sort of arrangement we looked at in I, fig. 7—and 
found to be unstable. Electrons would ‘ flow’ from the corners to the centres 
of the faces, until we had a sphere in k-space. But, if there were big energy gaps 
to overcome at these zone faces, this process would not seem possible, and an 
exactly full zone would be achieved. The electrons in this zone would fall into 
a full energy band, with a gap above it. 

A full band is electronically inert—rather like a closed shell of electrons in an 
atom. ‘The element would not be a metal, but a semiconductor. Any electrical 
conductivity could be traced to minor deviations from the perfect closed-band 
state. A few extra electrons may be introduced along with chemical impurities, 
or there may be thermal excitation of a few electrons above the energy gap into 
the next band, where they are more or less free to carry electrical currents. By 
comparison with a metal, the electrical conductivity would be small, and very 
sensitive to changes of temperature and to traces of dirt. If the energy gap is 
very large (diamond is an example), the solid may even be effectively an insulator. 
In such a case, we might begin to wonder whether a localized states for the 
electrons are discussed in I § 7, might not be more appropriate. The ‘ energy 
gap ’ may then be thought of as an ‘ ionization energy ’ or ‘ electron affinity ’ for 
the release of an electron from the atom or ion at an individual lattice site. 

As it happens, all Group II elements are metals: we must go to Group IV of 
the periodic table, to carbon, silicon and germanium, to find a semiconductor. 
‘The argument that I have just presented assumes that the bands do not overlap. 


} Strictly speaking, (L +1), if we include the two end points; the difference is insignifi- 
cant because L is very large. 


Electrons in Metals 331 


This is impossible in one dimension. But in three dimensions we have to study 
the whole zone. Although there will be an energy gap when we cross the zone 
boundary at any point, this gap does not come at the same absolute energy at all 
points on the boundary. It may be (fig. 20) that the top of the gap at some point 
A lies actually below the bottom of the gap at some point B. Then, as we add 
electrons, they will ‘ spill over ’ into the upper band of states around A, instead of 
filling right up into the corner at B. This means that the lower band is not 


(a) (Db) 


Fig. 20. (a) Energy contours, and a Fermi surface for a divalent metal. (6) Cross-sections 
of the Brillouin zone along two directions, showing overlap of bands. 


quite full, and the upper band not quite empty. The electrons in these bands 
are no longer energetically isolated under energy gaps, and can easily be made to 
carry electric currents etc. 


7. "THE FERMI SURFACE 


The essence of the argument of this chapter is that the periodic lattice has the 
effect of carving the energy distribution of the conduction electrons into a series 
of bands. Each band can contain two electrons per unit cell. From this alone 
we can argue that an element such as Cu, with one electron per atom, one atom 
per unit cell, must be a metal, for there simply are not enough electrons to fill a 
band. The same goes for Al, with the same crystal structure and three electrons 
per atom. But Mg with two electrons per atom, and two electrons per unit cell, 
could be a semiconductor—if it were not for large overlaps across the zone 
boundaries. Some of the tetravalent elements are metals, some are semi- 
conductors, because in some cases the bands overlap whilst in others they are 
separated by a substantial energy gap. The case of As, Sb and Bt is interesting. 
With 5 electrons per atom, we might anticipate simple metals. But they crystallize 
in a structure with two atoms per unit cell, so there are really 10 electrons to 
accommodate. — As it happens, the 5 bands are nearly all full, but there is a little 
overlap (in Bi, about 1/1000 of an electron per atom!) into a 6th band. These 
elements are such poor conductors that they are called ‘ semi-metals.’ 

It is also interesting to notice what happens when we melt a solid. The 
crystal lattice disappears and the sharp Bragg reflections are smudged out into 


332 J. M. Ziman 


one or two broad peaks. The Brillouin zone boundaries lose their meaning, 
and the energy gaps more or less disappear. ‘Thus, the bands coalesce, and we 
have an electron distribution which is effectively the same as in a free-electron 
gas. We find that semi-metals and semiconductors become metallic when they 
melt, so that the liquid Pb and liquid Ge have nearly the same electrical properties. 

One can evidently give a good qualitative account of the properties of metals 
from an analysis of their band structure. One can even go further, and construct 
curves representing the distribution of electrons in energy in different bands— 
the function being the density of states, defined such that 4’(¢)d¢ is the number 
of electrons states in the range of energy & to & +dé. (fig. 21). If (é) is 
small at the Fermi level, then we deduce that the metal is not a good conductor, 


NE) | 


Calee 


Fig. 21. A theoretical estimate of the density of states in the d-band of 
a face-centred-cubic metal. 


and so on. But for a quantitative account the band picture is inadequate. It 
does not allow for the differences between the properties of different electron 
states at the Fermi level itself. In fig. 20, for example, the states near the corners 
of the zone are entirely different from those that have been filled by overlap in 
the second band. In more complicated cases, electrons that are nominally in 
the same band may not be equivalent, and must be treated separately in calcula- 
ting the macroscopic properties. 


But we are still only interested in states near the Fermi level, and these all 
lie on or near a surface of constant energy in k-space. This surface, defined 
by the relation 

E(k)=6,, (18) 
automatically provides for us all the electron states that can play any part in the 
ordinary transport properties of the metal, or in such thermal properties as the 
specific heat. It is called the Fermi Surface. "The name has been traced back to 
a paper by Bardeen, in 1940, but the concept can be found in much earlier papers, 
as long ago as 1930. From a mathematical point of view, k-space is the natural 
medium for the classification of electronic states, and the construction of surfaces 
of constant energy is then irresistible. We automatically talk of ‘ Electrons on 
the Fermi surface ’, instead of ‘ electrons at the Fermi level’, and when we cool 
our metal to absolute zero we say that ‘all states inside the Fermi surface are 
full; all those outside it are empty ’. 


In a free electron gas, of course, the Fermi surface is a sphere. This is 
essentially the construction of I. fig.9. The geometry of a sphere is so simple 
that we need scarcely use a concrete model of this sort, and can calculate the 


Electrons in Metals B68) 


electronic properties explicitly by analytical methods.+ But in a real metal the 
Fermi surface is almost sure to be different from asphere. Even ina monovalent 
metal the energy in certain directions in k-space may be so much reduced by the 
energy gaps at the nearest zone boundaries that the surface is drawn into contact 
with the zone faces. If there are two or more electrons per atom, the influence of 
the zone boundaries is sure to be profound, for they will certainly intersect any 
free-electron sphere that we draw, and thus will introduce energy gaps at every 
line of intersection. 

What this means is that the properties of the electron gas will not be isotropic. 
The dynamical properties of an electron at the Fermi level will depend on its 
direction of motion. If it is travelling in a direction where it might very easily 
suffer Bragg reflection by the lattice, then it may seem ‘slow’, and ‘ heavy’; 
in other directions it may behave very like a free electron. These dynamical 
properties just depend on ‘ where it is on the Fermi surface ’, so that in many cases 
the shape of the Fermi surface relative to the Brillouin zone is a guide to the 
electrical properties of the metal. 

This is obvious in the pre-war work. What is new, and exciting in this field 
is that the Fermi surface can be treated as if it were almost a solid object. It is as 
invariant as the Brillouin zone within which it is defined. By various elegant 
experiments we can trace out its shape with almost the same accuracy as we can 
survey a range of mountains. It is difficult to think of a case in the history of 
physics which better exemplifies the principle that ‘last year’s abstract 
mathematical construction is this year’s concrete model of reality ’. 


+ Because of this analytical simplicity, the free electron model lasted too long. Even 
when it was breaking down, attempts were made to save the algebra by ‘ generalization ’ to 
ellipsoidal surfaces. It is now obvious that Fermi surfaces are usually much more complica- 
ted than this—may, for example, be multiply-connected. 


Editor’s Note : 

Offprints of Dr. Ziman’s articles on Electrons in Metals will be bound together and 
published in booklet form on completion of the series. They will be available from 
the Publishers early in 1963. 


Physics Applied to Archaeology—Part I 


by M. J. AITKEN 


Research Laboratory for Archaeology and the History of Art, 
University of Oxford 


Magnetism is one of the oldest branches of Western science and the compass 
has long been used for navigation, certainly for the past thousand yearsf. It 
is appropriate, therefore, that magnetic techniques should play an important 
part in the archaeological field, and this article will be mainly concerned with 
Magnetic Dating and Magnetic Location. 

The direction of the earth’s magnetic field in any given region varies by a 
degree or two every twenty years, and magnetic dating is possible from this 
because when the ancients fired pottery kilns, ovens, furnaces, etc., they un- 
knowingly recorded this field direction in the thermo-remanent magnetism 
acquired by baked clay that contains a few per cent of iron oxide. The same 
property makes it possible to locate buried kilns, etc., using the recently 
developed proton magnetometer to detect the slight magnetic disturbance 
created at ground level. Other types of archaeological remains are detectable, 
too, notably those in which chemical changes in soil associated with human 
occupation have resulted, somewhat fortuitously, in an enhanced magnetic 
susceptibility. 


1. MAGNETIC DATING 


It has been known for over a century that a rock, if heated to a dull red heat 
and allowed to cool, will acquire a weak permanent magnetism, the direction of 
which is the same as the lines of force of the earth’s magnetic field. The same 
phenomenon occurs in baked clay, and at the end of the nineteenth century an 
Italian scientist (Folgheraiter 1899) measured the direction of the remanent 
magnetism in some ancient vases which had been buried at Aretto for over two 
thousand years. He found that the remanent direction was independent of the 
orientation in which each vase was found, but that it made roughly the same 
angle with the base, suggesting, firstly, that the remanent magnetism acquired 
after baking was stable over thousands of years, and secondly, that the base of 
the vase had been horizontal during baking. Similarly, the remanent directions 
in the bricks of a Roman wall bear no relation to how they lie in the wall. 

The mechanism by which clay and rocks acquire this thermo-remanent 
magnetism will be outlined shortly. In practice it means that baked clay that 
has remained undisturbed since firing carries in it a record of the direction of 
the earth’s field at the time of firing. This direction is conveniently defined by 
the magnetic declination, D (the angle between magnetic north and true north), 
and the angle of dip, J (the inclination to the horizontal of the magnetic lines of 
force). ‘The secular variation of these angles over the past few hundred years 
is known from records of direct observations made on magnetized needles (see 


iT In the tenth century A.D., the laws of Gottland, an island in the Baltic, prescribed 
penalties for sailors who breathed on the compass after eating onions! 


Physics Applied to Archaeology 335 


fig. 1). Between a.p. 1600 and a.p. 1800 the magnetic declination in London 
Swings from 10° east of true north to 24° west. Consequently, within that 
period, if the remanent direction can be found to within + 2°, the date of firing 
can be deduced, by reference to fig. 1, to within +12 years. This is a much 
higher degree of precision than can be achieved with radiocarbon dating, but 


Bo Saas mrs = 
ae pas (aa | 7 lyal- aie inte twat Ee 7 {SP Sana FT TL FL pee a Pe PS ee eee 
r | Zee OTEIPYS Oy SS | 
res 1309 at XS 
| J 
60 F | 4 
62 + | 
64 + \\ | 
~ 66 + \ | 
Ey 
a = 
é \ 
68 + 
. 1000 \ | 
FON 4 
oO 
ie 
a 


2p 


wr 


‘LOG te 


i WEST EAST 
78 yl es SS ee es | 1 VC a SA ime HE Cee CR a Ce LY es GI | YC Ta PE YRS as J Ae Tel Tey a SSS ES SS ee ee ee eee 
24 20 16 12 8 4 ° 4 8 12 16 20 24 


DECLINATION (D) 


Fig. 1. Secular Variation (for London). The solid curve (A.D. 1580 to present day) is 
based on records of direct observations on magnetized needles. The dotted curves 
represent tentative results obtained by measuring the direction of the remanent 
magnetism in samples of baked clay taken from pottery kilns of archaeologically- 
known date. 


on the other hand, a knowledge of the secular variation is a necessary pre- 
requisite. Prior to a.p. 1600 its behaviour has to be determined by measure- 
ment of the thermo-remanent directions in clay samples of which the date of 
firing is known from archaeological evidence. Such measurements dispel any 
hopes that the past shape of the secular variation curve is predictable; this was 
unlikely on other grounds too. Thus magnetic dating is useful primarily for 
filling in precise detail, in a period of which the broad chronological outline is 
known. Even when established, the secular variation curve is only useful over 
a region about five hundred miles across. 


Thermo-remanent Magnetism 

The phenomenon of thermo-remanent magnetism has been exhaustively 
studied by Professor Thellier in Paris, and a theoretical interpretation has been 
given by Professor Néel of Grenoble (Thellier 1938, Thellier 1951, Thellier and 
Thellier 1959, Néel 1955). Most crude clays contain several percent of iron oxide. 


336 M. J. Aitken 


In red well-oxidized baked clay, this is in the form of haematite (a—Fe0s); 
which is weakly ferrimagnetic. The size of the haematite grains 1s small 
enough (perhaps 10-5 to 10-® cm) for each grain to be considered as a single 
magnetic domain, spontaneously magnetized along a direction dictated by the 
shape, or crystalline anisotropy, of the grain. ‘The intensity of the spontaneous 
magnetization is unaffected by an applied magnetic field, but, if the temperature 
is high enough, its direction can reverse, so that its component along the field 
direction is parallel rather than anti-parallel (fig. 2). This reversal is possible 


(a) Unbaked (b) Baked 


Fig. 2. Acquisition of Thermo-remanent Magnetism. The arrows represent domains 
in the ferrimagnetic iron oxide present as an impurity in the clay. Heating to 
several hundred degrees centigrade permits reversal of some domains so that they 
have a component in the same direction as the earth’s field (F). On cooling, the 
domains are ‘ frozen ’ again, but now there is a net magnetic moment in the direction 
of F. (Reproduced from Physics and Archaeology by courtesy of Interscience 
Publishers Ltd.) 


when the thermal-agitation energy of the grain is comparable with the potential 
energy of the constraints tending to keep the magnetization along the pre- 
ferred axis of the grain. The probability of reversal is a very sharply rising 
function of temperature, and it is possible to define a blocking temperature, 
T,, above which there is a high probability of reversal in a few seconds, and 
below which, reversal is only likely in a very long time. TJ) is proportional to 
the volume of the grain, so that if there is a wide spread of grain sizes in a 
specimen there will also be a wide spread of blocking temperatures. "Thus 
heating to only one or two hundred degrees centigrade will permit some grains 
to reverse their magnetization, and on cooling there will be a small remanent 
magnetism, though perhaps only 5 or 10 per cent of the saturation amount that 
would have developed had the baking temperature reached the Curie point 
(675 °c for haematite). 

In grey reduced clay the iron oxide is in the form of magnetite (Fe,0,). 
Although the grains in this case (10-4 to 10-? cm) are too big to contain only a 
single domain, the same thermo-remanent properties are exhibited. In both 
cases the magnetization acquired on cooling through a certain temperature 
interval is held by carriers whose blocking temperatures, T',, lie within that 
interval. ‘These carriers retain their remanent magnetization until the temperature 


—_———7. 


Physics Applied to Archaeology sat 


again reaches T,. ‘Thus a pot which, subsequent to firing, is reheated to a 
lower temperature T has two remanent components, that carried by grains having 
I’, > T which remains from the first firing, and that of grains having T,< T, 
acquired from the second heating. Depending on the orientations of 
the pot in the two firings, these components may be in different direc- 
tions. A special case is when the two components are equal but in opposite 
directions. ‘The resultant remanent magnetic moment is then zero. If the 
lower temperature component is now removed by heating to T and cooling in 
zero magnetic field, the specimen is left with a magnetization corresponding in 
direction to the first firing. Thus, by demagnetizing from successively 
increasing temperatures, it is sometimes possible to reconstruct the thermal 
history of a specimen. 


Measurement of Remanent Direction 


In order to find the ancient magnetic direction it is, of course, essential that 
the baked clay be found exactly in the position in which it cooled down. This 
condition is satisfied in pottery kilns, hearths, ovens and furnaces. Clay was 
used in the construction or as a lining because of its refractory properties. To 
determine the magnetic direction recorded in a kiln or other burnt structure, a 
dozen or so lumps of well-baked clay are extracted from the floor, or fairly low 
down the walls (in order to minimize the possibility of movement subsequent to 
cooling). Before each lump is detached, its orientation with respect to true 
north and to the horizontal must be marked on it. ‘To do this, it is partially 
enclosed in plaster, of which the top surface is made accurately horizontal. A 
line is then sighted on to this surface from a theodolite, of which the azimuth 
scale has been related to true north by means of a sun-shoot. ‘The direction of 


=<—Fibre 


N 


ae 
a 


Hes ‘SS Sample 


Fig. 3. Astatic Magnetometer. Because the magnets are of equal moment, the net 
restoring torque due to the earth’s field is zero. ‘There is a deflection due to a 
magnetized sample because it is placed nearer to the lower magnet than to the 
upper one. (Reproduced from Physics and Archaeology by courtesy of Interscience 
Publishers Ltd.) 

CoB: Ve 


338 M. J. Aitken 


the remanent magnetization in each sample is measured in the laboratory by 
means of a sensitive magnetometer and an average for the structure obtained. 
The direction differs slightly from sample to sample, not only because of errors 
in measurement and orientation-marking, but also because of distortion of the 
lines of force by the structure itself, and because of physical settling of 
the structure during the centuries. 

The simplest form of instrument used to measure the remanent direction is 
the astatic magnetometer. This consists of two bar-magnets of identical 
strength, fixed rigidly at either end of a rod about 4 in long (fig. 3), and suspended 
by a fine fibre of phosphor-bronze or quartz. Since the two magnets are anti- 
parallel, the torque on the upper one due to the earth’s field exactly cancels out 
that on the lower one; on the other hand, the torque on the lower magnet due to a 
small sample held below the lower magnet at a distance equal to the separation 
of the two magnets (7 =/ in fig. 3) is eight times the opposing torque exerted on 
the upper magnet. With careful design and construction (see, for example, 
Blackett 1952), the field due to samples having moments of only 10~’ emu can 
be detected. While this sensitivity is more than adequate, the astatic magneto- 
meter has the drawback that, except for small samples, irregular shape or 
non-uniform magnetization can impair the accuracy with which the remanent 
direction is determined 

The ‘ spinning’ magnetometer avoids this difficulty but its construction is 
more complex. ‘The sample is rotated at 300 rev/min within a large electrical 
coil system. ‘The e.m.f. (the order of microvolts) induced in this coil system by 


High-gain 
selective 
amplifier 


Sample 
Oscilloscope 
Reference coll 


\ ), 


RT 


Rotating 
table 


Rheostat 
Rotor slip-rings 


Stator battery 


A | Magslip i 


‘Stator adjustable by hand 


5 c/sec 


LT 


Reduction Synchronous 
<i AO gear motor 
SO c/sec 


Fig. 4. Spinning Magnetometer. The amplitude and phase of the signal fed to the 
reference coil are adjusted until the signal from the spinning sample is exactly 
annulled. The direction of the sample’s magnetism is then deduced from the 
setting of the stator that was necessary for exact annulment. (Reproduced from 
Physics and Archaeology by courtesy of Interscience Publishers Ltd.) 


Physics Applied to Archaeology 339 


the rotating magnetic moment of the sample is fed to a high-gain selective 
amplifier. The remanent direction is found by measuring the phase-angle of 
this induced alternating voltage with respect to the driving shaft. Fig. 4 shows 
one way of doing this: the amplifier output is reduced to zero by means of an 
annulling signal picked up from a reference coil. This coil is fed from a 

magslip ’ generator driven by the rotating shaft. The phase of the annulling 
signal is adjusted by altering the angle of the magslip stator, and from the setting 
required the remanent direction can be deduced. 


Results 


‘The number of structures excavated each year which are well-dated archaeo- 
logically, and also suitable for magnetic sampling, is limited. Consequently it 
will take a long time to complete the secular variation curve back from a.p. 1600 
to say 1000 B.c., with sufficient precision and certainty for useful dating applica- 
tion. However, apart from this, the results have an intrinsic geophysical 
interest. Any theory of the origin of the earth’s magnetic field must also account 
for the observed secular variation—and obviously the longer the period for 
which data are available the better. It is at present believed that the main part 
of the earth’s field is generated by electrical currents in the fluid conducting core. 
Superimposed on this main part are the magnetic fields of currents in the upper 
atmosphere; these rarely exceed more than a few per cent of the main part, and 
being responsible only for short-term fluctuations, do not concern us here. 

The currents in the core are presumed to form a sort of self-exciting dynamo 
(Elsasser 1956), for which the source of energy is motion due to convection. 
The field produced by these currents approximates to that of a bar magnet at 
the centre of the earth, and the exact field that would be produced by such a 
magnet is termed the ‘ dipole field’. Over the earth’s surface there are about 
ten regions of continental extent over which the actual field deviates both in 
direction and intensity from the dipole field; this is apart from the much smaller- 
scale disturbances that arise from iron-bearing rocks. ‘The large-scale dis- 
turbances drift westward at the rate of between 0-2 and 0-3 degrees of longitude 
per year. These disturbances are thought to be due to irregularities in the 
electric-current pattern near to the surface of the core, and the westward drift 
is interpreted as a slow motion of the mantle and crust of the earth with respect 
to the core (Bullard 1958). The secular variation arises from this relative 
motion, too, and it has been shown (Runcorn 1959) that the clockwise movement 
of the representative point of fig. 1 is consistent with the crust rotating slightly 
faster than the core. The secular variation curve is not likely to repeat itself 
however, for the irregularities in the core-currents may not persist for more than 
a few hundred years, giving insufficient time for them to make a complete 
circuit with respect to the crust. The results so far obtained from thermo- 
remanent measurements (see dotted curve, fig. 1) agree with these ideas, except 
that the anti-clockwise characteristics suggest that the motion of the core has 
sometimes been eastwards with respect to the crust. 

In addition to the ancient direction it is also possible to deduce the ancient 
intensity of the magnetic field. This is done by comparing the remanent 
strength of magnetization found in a specimen, with the remanent strength 
after re-heating in the present-day earth’s field (the intensity of magnetization 
acquired is proportional to the strength of the field as long as the field is weak). 


340 " M, J. Aitken 


By comparing the ratio of ancient magnetization lost to new magnetization 
gained over successive temperature intervals, it is possible to check whether 
mineralogical alteration has occurred during the re-heating (Thellier and 
Thellier 1959). The results obtained by this technique indicate that in France 
the earth’s field strength has decreased by a factor of nearly two during the past 
two thousand years. Since the remanent angle of dip measured in the specimens 
corresponded approximately with the dipole field value, the most obvious 
interpretation of these results is that there has been a general decrease in the overall 
strength of the dipole field. Since cosmic-rays are deflected away from the 
earth by its magnetic field, it follows that such a large change ought to be reflected 
as a systematic error in radiocarbon dates (see previous article); the apparent 
absence of any such systematic error does not invalidate the thermo-remanent 
measurements, however, for the variations in magnetic field may be on too short 
a time-scale (compared to the 5800 year half-life of radiocarbon) to affect the 
equilibrium radiocarbon concentration seriously. 


Geological Measurements 

Thermo-remanent magnetic measurements are by no means confined to 
archaeological specimens. Lava from volcanic eruptions of known date is another 
source of evidence (e.g. Chevallier 1925, on Mount Etna) although the gross 
disturbance to the field direction by the volcano itself gives rise to inaccuracy. 
Of much greater importance for geophysics, however, is the information about 
conditions in past geological eras that can be obtained by a study of ‘ rock 
magnetism’. ‘Thermo-remanent magnetism is acquired by igneous rocks as 
they cool from the molten state—as might be expected from the foregoing 
paragraphs. In addition, remanent magnetism is found in sedimentary rocks; 
this results from preferential alignment of magnetic particles as they are deposited 
during formation of the rocks. Samples taken from a given geological stratum 
cover too long a period of time to give information about the secular variation; 
the average remanent direction deduced smooths this out, and the resulting 
angles of declination and dip should correspond to a magnetic dipole coincident 
with the earth’s axis of rotation. The angle of declination should therefore be 
zero (i.e. magnetic north identical with true north) and the angle of dip (I) 
defined by the latitude (A) according to the relation 


tan I[=2 tan), 


varying from zero at the equator to 90° at the poles. The actual remanent 
directions found sometimes differ considerably from these expectations. For 
instance in rocks in Britain of the Triassic period (about 170 million years ago), 
the remanent declination is 34° east and the remanent angle of dip is 25°. 
‘This angle of dip corresponds, according to the relation just quoted, to a latitude 
of 13° north—some 40° nearer the equator than at present. The easterly 
angle of declination is interpreted as indicating that a clockwise rotation of the 
land by 34° has occurred since the rocks were formed. Other remanent results 
are in agreement with the idea that India, South Africa, South America and 
Australia were once closely grouped around the South Pole. This idea was 
originated by geologists from evidence of widespread glacial conditions in 
those countries some 200 million years ago. In some geological eras, however, 
the magnetic results suggest ‘ polar wandering ’ as the more likely hypothesis. 


Physics Applied to Archaeology 341 


More startling still is the inference that complete reversals of the earth’s 
magnetic field have occurred in the past, the representative dipole still lying 
approximately along the earth’s axis of rotation but in the opposite sense to the 
present one. The process of reversal takes place quite quickly geologically 
speaking (probably in less than 10 000 years) and alternative directions last for 
the order of 500 000 years. 

Natural remanent magnetism is not of interest archaeologically except for 
that acquired by varved clays. As in sedimentary rock formation, the magnetic 
particles are preferentially orientated while in suspension prior to deposition. 
Since varved clays can be accurately dated by counting the annual layers, this 
can be a valuable record of the secular variation. Unfortunately the remanent 
angle of dip appears to be influenced by other factors, but the declination is 
faithfully preserved as long as the deposition occurs from fairly calm water. 


2. MAGNETIC LOCATION 
Pottery Kilns 

Quite apart from the réle of recording the ancient geomagnetic field, pottery 
kilns occupy an important position archaeologically. Fragments of pottery, 
being durable and common, often form the main evidence about a site’s history. 
The occurrence of similar types of pottery on two different sites forms a link 
between those sites, and dating evidence obtained on one can be carried over to 
the other—pottery fragments bear some similarity to sub-standards in physics. 
A kiln is the source of these sub-standards and, since the useful life of a kiln was 
probably less than a year, discovery of different pottery types in the same kiln 
immediately establishes contemporaneity. 

Pottery kilns are preserved for us today because, in order to simplify con- 
struction and to obtain good thermal insulation, they were dug into the ground. 
Although this circumstance has preserved them, it has also made them difficult 
to find, for the old ground surface is often several feet below the present one due 
to gradual accretion of soil, and there is usually no surface indication, beyond a 
general scatter of pottery fragments when the ground is ploughed. However, 
the construction of even a small kiln demands several hundredweights of clay, 
and the thermo-remanent magnetism acquired by this on baking is sufficient to 
cause a significant increase in the magnetic field strength just above the surface 
of the ground. In practice, a couple of yards above a kiln the increase is of the 
order of 0-001 oersted, i.e. a change of 1 part in 500 in the normal value of the 
earth’s magnetic field. Although this is easily detectable with the conventional 
torsion-fibre type of magnetometer, the slow speed of operation of such instru- 
ments is a very serious drawback, and the magnetic location of pottery kilns 
only became a practical matter with the availability, in portable form, of the 
recently-developed proton magnetometer. Each measurement with this 
instrument consists essentially of observing the free-precession of protons in 
the earth’s magnetic field by virtue of their nuclear spin and magnetic moment. 
Although this sounds a complex process—and indeed such free-precession was 
observed under laboratory conditions only in 1953 (Packard and Varian 1954)— 
the operation of the instrument can be made speedy and simple, and by the use 
of transistors its weight can be reduced to just over 20 lb. 


342 M. J. Aitken 


The circumstances which stimulated the development of this instrument— 
already in use for geological surveying—for archaeological work, illustrate 
forcibly the need for rapid operation. In 1958 re-routing of the Great North 
Road near Waternewton in Huntingdonshire took its path across the outskirts 
of the Roman camp of Durobrivae. From charts remaining of excavations made 
in 1828, it was expected that the two-mile stretch of new road might cut into 
pottery kilns that had been producing the rather special colour-coated (or Castor- 
ware) type of Romano-British pottery. The chance of hitting upon a few 
kilns by random trial-trenching in the 35 acres concerned was clearly infini- 
tesimal. Even coverage with magnetic survey was no light matter, for readings 
spaced only 5 feet apart were considered necessary and this involved 60 000 
measurements. With a proton magnetometer, because no levelling is involved 
as in the conventional type, readings can be made at the rate of 6 per minute. 
This made it possible to cover the area in the space of several weeks, whereas 
with the conventional instrument it would have taken nearly a year. 


KILN BOTTLE HEIGHT | FOOT 


NORMAL 


90 Beit ee 
— READING [ 50250 


f=) 
z és 
<< 
s 80 = 50200 
S w 
= = 
a 
=z ' 
w ! 
= ! 
70 50150 
fo <4 
w 
a 


PARTS 
5 GAMMAS ) 


PROTON MAGNETOMETER READING 


90 

>on 50250 
* 
a at 
AT < 
=i 4 

a 80 a 50200 
esr \ 
- : 
B STOKE-HOLE 
< 
= 70 50150 


10 0 0 20 30 
FEET 


Fig. 5. Magnetic Anomaly due to Pottery Kiln. Two magnetic traverses through an 

rare which, on excavating, was found to be due to a Romano-British pottery 
iln. ‘The pedestal is in the centre of the kiln and c i i i 
ontains an appreciable pro 

of the baked clay. = CAR 
The vertical scales show the magnetic intensity changes. These have been 
inverted to correspond with the sense of the proton magnetometer reading which 
increases when the intensity decreases. (1 gamma=10- oersted.) 


Physics Applied to Archaeology 343 


Altogether seven kilns were found, six of them lying within 100 yards of each 
other, thereby demonstrating the importance of complete coverage. Figure 5 
shows detailed measurements along traverses across the top of one of the kilns 
after its presence had been inferred from an initial survey and demonstrates the 
need to keep the spacing between measurements in the initial survey down to 
5 feet at the most. For smaller features such as ovens and hearths, also detectable 


on account of their thermo-remanent magnetism, a still closer spacing is 
necessary. 


Pits and Ditches 


In addition to burnt structures it is also possible to detect filled-in pits and 
ditches magnetically. Here the magnetic disturbance (or ‘ anomaly ’) results, 
not from remanent magnetization, but because the filling has a higher magnetic 
susceptibility than the adjacent substrata into which the pit or ditch has been 
cut. The filling may be top-soil that has silted in during the years for which the 
pit (or ditch) has been left open, or it may result from domestic rubbish, refuse, 
and excreta deliberately thrown in. Such deliberately-filled pits are particularly 
valuable for the archaeologists since they contain a wealth of material in con- 
temporary use. ‘The enhanced susceptibility of pit-fillings arises from a subtle 
mechanism, apparently designed by nature for the special benefit of archaeo- 
logical magnetic location. ‘The blacker and dirtier the filling, i.e. the greater 
its involvement with human occupation, the stronger is the magnetic disturbance 
created. As one passes from the sterile substrata (whether limestone, clay, 
chalk or gravel, but excluding igneous rocks) through top-soil to humus-laden 
refuse pits, the magnetic susceptibility increases from 10-* or 10-® emu/gm to 
the order of 10-*emu/gm for top-soil, reaching 10-*emu/gm for a really 
“riper, pit. 

According to Le Borgne (1955, 1960) the enhanced susceptibility of top-soil 
results from the conversion of the iron oxide in the soil from its weakly ferri- 
magnetic form, haematite (w—Fe,O 3) to the more strongly magnetic form of 
maghemite (y—Fe,03). The conversion proceeds by reduction to magnetite 
(Fe,0,) and subsequent re-oxidation to maghemite. Two mechanisms have 
been suggested. The first occurs at ordinary temperatures and is favoured by 
alternating periods of humidity (for reduction) and dryness (for oxidation). ‘The 
second mechanism, the more powerful of the two according to Le Borgne, is 
the cumulative effect of fire on the soil, ground clearance by burning being 
postulated as inherent in the methods of cultivation employed in antiquity. 
Although elevated temperatures have intervened, the anomaly is due to instan- 
taneous susceptibility rather than thermo-remanent magnetization: the latter 
effect is destroyed as cultivation of the ground proceeds. 

Whatever the mechanism, the practical result seems to be that pits a yard 
across, a yard deep, and buried beneath a yard of overlay produce an anomaly 
upwards of 10-4 oersted, depending on the blackness of the filling —blackness 
being significant because it indicates high humus content and/or burning. ‘This 
is comfortably above the limit of sensitivity-of the proton magnetometer which 
can detect differences of less than 10~® oersted. It is also above the random 
fluctuations from point to point on most terrains; these are presumably due to 
variations in the condition and depth of the top-soil. More serious are natural 
anomalies due, for instance, to solifluction channels in chalk. These are 


344 M. J. Aitken 


sometimes difficult to distinguish from ‘weak’ pits, but except where the sub- 
stratum is igneous—when the thermo-remanent magnetism of the rocks themselves 
may mask any archaeological anomalies—natural anomalies having a spatial 
extent comparable with the dimensions of archaeological features (say 3 to 10 
feet across) do not often exceed 10~4 oersted. 

Besides natural variations from point to point, there is also the difficulty of 
the rather irregular diurnal variations that result from ionization currents in the 
upper atmosphere. Under severe ‘ magnetic storm’ conditions changes of 
10-4 oersted can occur within tens of seconds; magnetic surveying is only possible 
then if automatic correction is made for these changes by means of an auxiliary 
magnetometer kept at a fixed station, or if some form of gradiometer is used (see 
later). Such occasions are fortunately rare, and under normal conditions, as 
long as an area (say 50 ft x 50 ft) large compared to the dimensions of archaeo- 
logical anomalies is covered fairly quickly (say within 15 min) the effect of the 
diurnal variation is unimportant. Disturbance similar to magnetic storms is 
caused by electric trains and trams run from D.C. 

Two Applications 

The detection of pits is particularly valuable on Iron Age hill-forts. Although 
the ramparts bear impressive witness (as for example at Maiden Castle) 
to the military activity of the times, the evidence about domestic details is less 
easy to come by. ‘The area inside the ramparts of these forts may amount to 
ten or twenty acres and its surface is distressingly blank to the archaeologist. 
Hill-forts, because of their situation, are not often under crop and consequently 
aerial photography is not usually of much help. 


MAGNETIC CHART of DANES CAMP 


saa = 
= = 


cao \ 
@ VIOLENT INCREASE Pe \ 
e¢ MILD INCREASE ais a nwrnnt peers , ! 
X MILD DECREASE py Pitas : ; 
- ait ‘ I 
- = ‘; 
- - v t 
! 
50 FT oe | i 1 
ee ° 1 
ae e | ‘ i 
ie Dice Z 
Ae ! <a tes 
2 
2 Se eee = ENTRANCE 
’ Bick s A e | -s 
- Nes 
/ ate ea ¥ | i \ 
4 oe H \ 
’ ae omx ‘ ‘ 
, t ‘ ‘e | 1 } 
al ae | : eri y 
1 \ \ ° \ t ‘ 
, ! \ / 
, \ ' . ' 1 
Now Cpe Rs 2 ef U / 
-- ° tA 
ENTRANCE o- \ CW O. a ? / 
Zink ANNEXE ‘ ’ ote ° ve, - 
‘ ! / e ¥ e Bic NG 
‘ ' i oom 99 AL ees ; pve 
\ \ hae = eater ste asl A 
\ 1 me / 
» i Xe x 7 
' \ e paw 74 
\ ya 
- 5 a “pa er 
Se SOE SIS he ee nai se 
vt S 
. Ne = 
ss Ba 2 


Fig. 6. Magnetic Chart of an Iron-Age Hill-fort. The magnetic survey shows clearly 
that only the main part of the fort was intensively occupied; the annexe was probably 
used for cattle. Excavation on a dozen strong anomalies revealed pits in most 


cases. (Violent increase 3 x 10-4 oersted or more. Mild increase 10-4 to 3 x 10-4 
oersted.) 


Physics Applied to Archaeology 345 


Figure 6 shows the magnetic anomalies on one of the first hill-forts surveyed 
with the proton magnetometer. About a dozen of these anomalies were exca- 
vated and, with two exceptions, pits were found. The two exceptions were a 
well and an iron poker (or cauldron support?). The amount of digging to find 
the same amount of archaeological evidence by random trial-trenching would 
require resources that are quite out of the question for archaeologists today. 

Ditches do not show up as strongly as pits because their filling tends to be 
more sterile, consisting of natural silting and rampart fall-in rather than refuse. 
Although the resulting anomaly is rather small, perhaps 5 x 10-5 oersted or 
less, the fact that a ditch is a continuous feature makes it more easily recognizable 
magnetically (as also in an aerial photograph). Recent magnetic surveys at 
Verulamium (St. Albans) illustrate this. Here, a substantial defensive ditch 
was followed through the magnetic confusion created by isolated features, both 
archaeological and modern, for over a mile. Excavation at Verulamium in the 
nineteen-thirties had suggested that the early defences of the first century A.D. 
had lain outside the later town-wall. In 1955 excavations in connection with 
road improvements on Bluehouse Hill revealed an early ditch within the town- 
wall. The course of this could not be followed by trial-trenching because 
apart from anything else it appeared to run under the Corporation sports 
ground. In 1959, tests with the proton magnetometer adjacent to the discovered 
section showed it to be just detectable, the anomaly being somewhat less than 
10-4 oersted. Subsequently it was followed across the rugger pitch and through 
to the first-eleven cricket pitch. In the middle of the sacred 22 yards it turned 
through a right-angle and was then followed down to the artificial lake, with a 
break that corresponded to the alignment of Watling Street. ‘The course 
traced with the proton magnetometer—and subsequently confirmed at several 
points by excavation (not on the cricket pitch)—has now been interpreted as the 
outline of the first-century city. It lies entirely within the later town-wall. 


The Proton Magnetometer 
The phenomenon of proton free-precession has already been described in 


EARTHS EARTHS 
MAGNETIC GRAVITATIONAL 
FIELD, F FIELD 
GYRATION GYRATION 


PATH ATH 
y en 


z SPIN SPIN 


“> MASE GRAVITATIONAL 
yi | OE SY TORQUE ON TOP 


Clg PROTON 


(a) Cb) 


Fig. 7 Gyration of: (a) a proton in a vertical magnetic field; (6) a spinning top. (Repro- 


duced from Physics and Archaeology by courtesy of Interscience Publishers Ltd.) 


346 M. J. Aitken 


this journal (Feb. 1962), but an elementary recapitulation may be acceptable to 
some readers. Because a proton has a magnetic moment (1:4 x 10~-** emu), 
there is a magnetic torque on it tending to align it along the direction of the lines 
of force of the earth’s magnetic field. ‘The gyroscopic properties associated 
with its nuclear spin momentum prevent such alignment happening immediately, 
and the proton gyrates while in the gradual process of achieving this direction. 
These gyrations are similar to those of a top towards the end of its spin (fig. 7). 
In the case of a proton the frequency of precession is exactly proportional to the 
magnetic field strength. This frequency is about 2000 cycles per second for a 
field strength of 0-48 oersted (average figure for Britain). 

In order to detect this phenomenon it is necessary to have a large number of 
protons gyrating in phase. Apart from liquid hydrogen, ordinary water contains 
nearly as high a number of protons per cc as any other liquid. A convenient 
experimental arrangement is a quarter-pint bottle of distilled water around 
which a 1000-turn coil of copper wire is wound. By passing a current of about 
one amp through this coil, a ‘ polarizing’ field of several hundred oersted is 
produced along the axis of the bottle; this is orientated perpendicularly to the 
lines of force of the earth’s field (i.e. axis horizontal and east-west). ‘This 


TIMER }, VOLTS 


FREQ. 
DIVIDER 


DECADE 


UNITS 


CRYSTAL 
OSC. 


1000 100 10 \ 


10000 OR MONITOR 


Fig. 8. The Proton Magnetometer: Block diagram. Each decade meter indicates a 
digit, and the five-figure number so formed is inversely proportional to the magnetic 


intensity. (Reproduced from Physics and Archaeology by courtesy of Interscience 
Publishers Ltd.) 


polarizing field tends to align the protons though, because of thermal agitation, 
the effective fraction aligned is only 1 in ten million. Two or three seconds are 
necessary to achieve this, after which the polarizing field is cut off sharply. The 
protons now gyrate and an alternating e.m.f. of about a microvolt is induced in 
the coil. ‘This is switched to a low-noise highly-selective amplifier, the output 
of which is fed to frequency-measuring circuits. 


Physics Applied to Archaeology 347 


Due to external and internal field inhomogeneities, the gyrating protons 
gradually lose phase coherence, and the signal decays away within a few seconds. 
However it is possible to measure the frequency of the proton signal to a precision 
of 1 part in 50000 within half-a-second by the following technique. After 
amplification (figs. 8 and 9) the signal is squared and fed to a ten-stage binary 


Fig. 9. The Proton Magnetometer: Waveforms. (a) Proton signal from amplifier, 
fx 2000 c/s. (6) After squaring. (c) After frequency division by two, f¥ 1000 c/s. 
(d) After frequency division by ten binary stages, f¥2c/s. (e) Gated output of 
100 ke oscillator. (Reproduced from Physics and Archaeology by courtesy of 
Interscience Publishers Ltd.) 


chain. This divides the frequency by a factor of 1024 (21°), so that the resulting 
square wave has a frequency of approximately 2 cycles/sec. This is fed to a 
gate in such a way that the gate is open for exactly the first 1024 proton gyrations, 
after which it remains closed. While the gate is open, the output of a 100 kc/s 
crystal-controlled oscillator is allowed through to a decade counting chain. 
This records the number of 100 kc/s oscillator pulses that take place during 
exactly 1024 proton gyrations. This five-figure number, each digit of which 
is indicated on a separate meter, is inversely proportional to the magnetic field 
strength in which the protons are gyrating. 

The sequence of polarizing current and measurement is carried out automati- 
cally, so that the only action required of the operator is to read off the five 
digits. The detector-bottle, on a suitable mounting, is carried by a second 
operator over the area being surveyed. By using transistors throughout the 
power requirements can be met by a miniature 12-volt accumulator. 


348 M. J. Aitken 


The ‘ Bleeper’ 

In the proton magnetometer one obtains an absolute value for the fei ia 
field strength by comparing the proton precession frequency with that o a 
crystal-controlled oscillator. When locating archaeological remains, one 1s 
looking for significant differences in the field strength within a distance a 
parable with the dimensions of the feature concerned. One can there ore 
dispense with the crystal-oscillator by comparing the proton precession 
frequencies from two detector-bottles carried at either end of a rod six or eae 
feet long. The simplest method of comparison is to add the two proton signals 
together and observe the beat frequency. is 

The two bottles are fed by acommon polarizing current, so that at its cessa- 
tion the initial proton signals are in phase. If the two bottles are in identical 


Fig. 10. The Bleeper. The two detector-bottles are carried at either end of the staff on 
the right. The combined proton signals are amplified, and fed to the hearing-aid 
earphone. In a magnetic disturbance the two proton signals differ in frequency 
and ‘ beats ’ are heard in the earphone. 


Physics Applied to Archaeology 349 


field strengths then the sum of the two signals has twice the individual ampli- 
tudes, and decays away in about three seconds in the usual way. If the field 
strengths are sufficiently different then ‘ beats’ will occur. However, for small 
differences of field, say 10-4 oersted, the beat frequency is only 0-4 cycles /sec. 
Nevertheless such small differences can be detected because, again for a difference 
of 10-* oersted, the combined signal reaches zero after 1-25 sec, and this is 
clearly distinguishable from the case of equal fields. The time to reach zero 
can be measured electrically and an accurate measurement of the field difference 
thereby obtained. 

For archaeological work such accurate measurement is hardly necessary: 
it is enough to detect the presence of a magnetic disturbance and make a rough 
quantitative estimate of its strength. The ‘ Bleeper ’ (Aitken 1961) consists only 
of the necessary amplifying circuits to feed the combined signal to an earphone 
or loudspeaker. The duration of the signal is timed with a stop-watch. In this 
way the complexity, weight, and cost can all be reduced by an order of magnitude. 
Figure 10 shows such an instrument: it is a remarkably simple way of detecting 
a nuclear physics phenomenon which ten years ago had not yet been observed. 


The Differential Flux-gate Magnetometer 


The flux-gate magnetometer has been used in geological surveying for many 
years, particularly in airborne work. One difficulty with this type of instrument 
is that servo-motors must be used to maintain a precise orientation of the detector 
element with respect to the magnetic field. However, by using two detectors 
in a similar way to that just outlined for the ‘ bleeper ’, approximate alignment is 
sufficient for measurement of gradient as long as both detectors are firmly fixed 
to either end of arigid rod. Such an instrument has been designed by C. Locke 
and used archaeologically by Z. Wiatr in Jordan, in connection with the search 
for Dead Sea scrolls. As with the bleeper, large-scale disturbances, such as 
the diurnal variation or from D.C. trains, affect both detectors equally and are 
therefore ignored. 


3. OTHER METHODS OF ARCHAEOLOGICAL PROSPECTING 


Resistivity Surveying 

The electrical resistivity of soil varies between 10 and 10? ohm-cm depending 
on the water content, whereas for rocks the value is upwards of 104 ohm-cm. 
Consequently by measuring the effective resistivity between two probes stuck 
into the ground it is possible to detect the presence of such features as soil- 
filled ditches cut into rock, foundation-walls and roads intruding into soil, 
tombs and caverns. In practice it is necessary to use four probes in order to 
avoid the effects of variable contact resistance. A typical arrangement is shown 
in fig. 11. Current passes between the two outer probes, and the voltage thereby 
developed across the two inner proves is balanced against a voltage proportional 
to that current. The greater the effective resistivity of the ground, the greater 
is the ratio of inner probe voltage to outer probe current, and the higher must be 
the setting of the resistor, S, in order to obtain zero deflection in the detector. 
The depth of ground surveyed is roughly equal to the interprobe separation. 
There are various forms of the instrument. The smallest, employing transistors, 
can be held in the palm of the hand, and is powered by a 4-5 volt torch battery. 


350 M. J. Aitken 


An interesting application of the resistivity technique, using a geophysical 
form of the familiar ‘ megger ’, is the search for the royal treasure lost by King 
John in 1216 when an exceptionally high tide swept his baggage train from the 
causeway on which it was crossing the Wash estuary. The area concerned Is 
now dry land; no visible trace of the causeway remains, but evidence for its 
position has been obtained by resistivity measurements. It is supposed that 
the causeway followed a natural bank formed by a rise in the clay bed that is 


A.C. source 


Calibrated Null 


detector 


Fig. 11. Earth Resistivity Meter. The four probes are inserted in the ground at equal 
spacing, and the ratio of the voltage across the inner pair to the current injected 
through the outer pair is proportional to the resistivity of the ground. (Reproduced 
from Physics and Archaeology by courtesy of Interscience Publishers Ltd.) 


elsewhere covered with 40 ft of sand; since the resistivity of the sand is a little 
higher than that of the clay, it has been possible to detect the bank. The 
position of the treasure along the course of the bank remains unknown and it is 
unlikely that the relative size of such metal parts as still remain will be sufficient 
to produce an abnormal resistance reading. 


Electromagnetic Methods 


A readily available instrument for finding buried metal is the war-time mine 
detector. The usual principle on which this works is the slight alteration in 
the mutual inductance between two coils that is caused by the proximity of a 
piece of metal (due to eddy currents). Unfortunately the sensitivity falls off as 
the sixth power of the depth, and only large objects can be detected further 
away than a couple of feet. 

In N. America metal detectors are advertised as an essential part of any 
treasure hunter’s equipment. In Britain the same equipment is offered more 
prosaically to civil engineers for pipe detection. These devices usually consist 
of a transmitter coil set at right angles to a receiver coil a couple of yards away, 
both mounted rigidly on the same boom. The receiver coil is carefully orien- 
tated for zero pick-up over metal-free ground. In the proximity of buried 
metal eddy currents upset the null condition, and a signal is received. A 
carrier frequency of around 100 kc/s is employed, suitably modulated for audio- 
detection. The makers advertise that a detector of this type can detect a chest of 
bullion at a depth of 10 ft. Unfortunately such finds are rare in archaeology. 


Physics Applied to Archaeology 351 


Seismic Detection 


In geology, seismic exploration is of great importance. Explosive charges 
are let off and the times for reflected waves to return to the surface indicates 
the depths of various strata-changes. This is a cumbersome method to apply 
on the small scale required for archaeology, and more refined techniques are now 
being investigated by the University Museum at Philadelphia and by the Lerici 
Foundation in Italy. On the other hand, the old-fashioned technique of bosing, 
in which the ground is thumped with a heavy rammer, can be a very powerful 
one on the right terrain, and in experienced hands. Detection of tombs at 
Jericho is apparently done more effectively by a local inhabitant with a pickaxe 
handle than is ever likely with the techniques of modern science ! 


4. CONCLUSION 


It does indeed remain true that in the interpretation of evidence concerning 
human activity, the human faculties of a trained archaeologist represent a type 
of instrumentation that is difficult to better. Nevertheless there are a number 
of restricted fields in which the application of physics has proved a very powerful 
tool: radiocarbon dating, magnetic dating, magnetic location for instance. 
The value of these as scientific aids to archaeology is acknowledged, sometimes 
too readily. Less well appreciated is the aid which archaeology can give, and 
has given, in return. Authenticated wood from Egyptian tombs has helped to 
establish the constancy of the cosmic-ray intensity. Ancient baked clay, related 
by archaeological excavation to written records, enables the past behaviour of 
the earth’s magnetic field to be established. On a more utilitarian plane, 
archaeological requirements have stimulated technical developments in the 
application of proton free-precession, and of analytical techniques, particularly 
x-ray fluorescent spectrometry. 

On the face of it there seems to be plenty of intercourse across the gap that is 
supposed to divide Arts and Science. A realization of the potentialities of 
such intercourse led the late Lord Cherwell to initiate the establishment, in 1955 
at Oxford University, of the Research Laboratory for Archaeology and the History 
of Art. This institution is basically an applied physics laboratory, which seeks 
the collaboration and advice of outside archaeologists in the prosecution of its 
work. A significant proportion of the work described in these two articles has 
taken place in that Laboratory and the enthusiasm with which it has been 
received leaves no doubt of the value of Cherwell’s foresight. 


REFERENCES 


AITKEN, M. J., 1961, Physics and Archaeology, 52-59. (Interscience Publishers, New York 
and London.) 

Buiackett, P. M. S., 1952, A negative experiment relating to magnetism and the earth’s 
rotation. Phil. trans., 250A, 309-370. inten 

Butiarp, E. C., 1958, The secular variation of the Earth’s Magnetic Field. Year Book of 
the Physical Society, 47-60. 

Cuevauer, R., 1925, L’aimantation des laves de |’Etna et l’orientation du champ terrestre 
en Sicile du XII au XVIII siécles. Ann. Phys., Paris, 4, 5-162. 

Exsassrr, W., 1956, Hydromagnetic Dynamo Theory. Rev. Mod. Phys., 28, 135. 

FoLGHERAITER, G., 1899, Sur les variations séculaires de l’inclinaison magnétique dans 
Vantiquité. Arch. Sci. phys. nat., 8, 5-16. 


352 M. J. Aitken 


Le Borcneg, E., 1955, Susceptibilité, magnétique anormale du sol superficiel. Ann. 


Géophys., 11, 399-419. 

Le Borocng, E., 1960, Influence du feu sur les propriétés magnétiques du sol. Ann. 
Géophys., 16, 159-195. ; 

NEgx, L., 1955, Some theoretical aspects of rock-magnetism. Advances in Physics, 4, 191— 
243. 

Pacxarp, M., and Varian, R., 1954, Free nuclear induction in the earth’s magnetic field. 
Phys. Rev., 93, 941. 

Runcorn, S. K., 1959, On the theory of the Geomagnetic Secular Variation. Ann. 
Géophys., 15, 87-92. 

TueELuier, E., 1938, Sur l’aimantation des terres cuites et ses applications géophysiques. 
Ann. Inst. Phys. Globe, 16, 157-302. 

THELLIER, E., 1951, Propriétés magnétiques des terres cuites et des roches. 7. Phys., 12, 
205-218. 

"THELLIER, E., and THELLIER, O., 1959, Sur lintensité du champ magnétique terrestre dans 
le passé historique et géologique. Ann. Géophys., 15, 285-376. 


Electromagnetism for Schools, based on M.K.S.A. Units 


bys). HAY 
High School, Waimate, New Zealand 


Part I. The Principles 


1. EpITor’s INTRODUCTION 


There is still considerable opposition to the teaching of electricity in m.k.s.a. 
units in Britain. In many influential quarters this is regarded as an ephemeral 
craze which will run its course and die out; and it is difficult for mature teachers to 
acclimatize themselves quickly to change. This article from a teacher who 
has made the change happily, and from a part of the Commonwealth which 
has accepted it unanimously at University level and recommends it for schools, 
shows that the m.k.s.a. approach is both practicable and flourishing. Mr. Hay’s 
article is to some extent in note form; it has been arranged, but not tampered with, 


editorially. 


2. A Brier History oF THE M.K.S. System or UNITs 


The m.k.s. system of units, proposed by Giorgi in 1901, was adopted by the 
International Electrotechnical Commission (I.E.C.) of Electrical Engineers in 
1935, and recommended for universal adoption by the International Union of 
Pure and Applied Physics, Commission on Symbols, Units and Nomenclature 
(S.U.N. Commission) in 1948. In 1960 the I.E.C. adopted the ampere as the 
fundamental electrical unit, and the fourth fundamental unit in Physics; thus 
the m.k.s.a. system of units. 

In practice, two m.k.s. systems developed, differing from each other in 
fundamental formulas because they adopted different definitions of magnetic 
moment. ‘These are the Kennelly convention which requires the use of both 
B and H to represent magnetic fields with elementary classes, and the 
Sommerfeld convention which bases the study of all magnetic fields in free 
space on the flux density of the field B. ‘The Kennelly system was advocated 
as early as 1925 in articles in American and British electrical journals, but little 
was published in English on the Sommerfeld system until the appearance of 
‘Electromagnetic Theory’ by Stratton, in 1941. 

In 1938 the Committee on Electrical and Magnetic Units of the American 
Association of Physics Teachers (A.A.P.T.) issued a Report (American Physics 
Teacher, Vol. VI, p. 144) in favour of the use of m.k.s. units in electricity and 
magnetism. 

Twelve years later, the A.A.P.T. sponsored the Coulomb’s Law Committee 
Report, entitled ‘The Teaching of Electricity and Magnetism at the College 
Level’, probably the most influential report to appear on this subject since the 
war. This was published in the American Journal of Physics, Vol. 18, 1950, 
pp. 1-25, 69-88. The report examines critical issues in the logical development 
of electromagnetic theory, and outlines the traditional point pole approach, 


C.P. - 


354 E. J. Hay 


and also the modern approach in which all magnetic fields are referred to electric 
currents. The Kennelly and Sommerfeld conventions are compared and reasons 
are given (p. 22-23) why the Sommerfeld convention is preferable for use with 
the Amperian currents approach in ferromagnetism. 

The British Science Masters’ Association, published a report in 1954, entitled 
‘The Teaching of Electricity, with special reference to the use of M.K.S. 
Units’, which recommended the adoption in schools of the rationalized m.k.s. 
units. One may, however, prefer other starting points than those used in this 
report and another order, e.g. Coulomb’s Law Report. Unfortunately, this 
S.M.A. report accepted the Kennelly convention used in the 1938 American 
report. 

In New Zealand, the four university physics departments teaching electricity 
and magnetism have adopted the rationalized m.k.s.a. units, and the modern 
Amperian currents approach to ferromagnetism. The Auckland University 
has printed a 40-page circular entitled, ‘ M.K.S. Units for Schools’ by M. A. 
Thompson, to assist schools in introducing the new approach. This recom- 
mends the use of m.k.s. units in mechanics as well as electricity; regards this 
as inevitable in the long run; and exhorts the schools in the Auckland district 
to smooth out the difficulties of changing as soon as possible. It makes the 
point that students find it easier to change than their teachers do! Although 
excellent text-books are available at university level, teaching at matriculation 
or university entrance level in schools is handicapped by the absence of a suitable 
textbook, but steps are being taken to remedy this defect. Adoption of a 
revised entrance and scholarship syllabus is delayed for the same reason. 
Scholarship forms find ‘ University Physics’, Vol. 2, by Sears and Zemansky, 
a standard reference listed by three of our universities, is most helpful. 

In 1958, the State of Victoria, Australia, introduced the m.k.s.A. rationalised 
system of electrical units in the teaching of physics at fifth form level. (School 
Science Review, 139, 426.) 


3. ‘TRADITION IN ELECTRICITY 


The historical development of electromagnetism (i.e. electrostatics, electro- 
dynamics and electromagnetic induction) was much like putting together the 
different sections of a jig-saw puzzle, and then finally fitting these sections 
together to form the complete picture. Each section of the Gaussian and 
Kennelly theory is a homogenous unit, but the sections do not fit together well, 
to give a clear unified picture of the subject. Today, however, we can start 
with a look at the picture as a whole and then make a detailed study of its parts 
in some logical order. 

In Britain, those who have adopted m.k.s. units have generally done so, as 
far as possible, within the framework of traditional theory in the form of the 
Kennelly m.k.s. system, which uses the traditional approach to magnetism by way 
of magnetic poles, and emphasizes the H field in magnetism and the artificial 
magnetostatic theory which divorces ferro-magnetism from electromagnetism. 
It represents a reluctance to abandon unit magnetic poles and magnetostatic 
theory in favour of the direct interaction between electric currents and the 
acceptance of all magnetism as an inherent property of the moving charge. 

It is felt that a modified form of the Sommerfeld m.k.s. system employing 


Electromagnetism for Schools 355 


Amperian currents, which is widely used in American universities and has been 
adopted by all four New Zealand universities, is simpler than either the Gaussian 
c.g.s. system or the Kennelly m.k.s. system, and more suitable for use in schools. 


4. SoME EssENTIALS OF THE MOpIFIED SOMMERFELD M.K.S. System 


4.1. The Parallel Currents Absolute Ampere 


The modern approach develops all electrical theory from the ampere, as 
defined from the force between parallel currents. Electrostatics is derived 
from the coulomb, the unit of charge derived from the ampere. The unit of 
electric field intensity is defined by the force on a static charge, while magnetic 
flux density is defined by the force on a moving charge or current in a conductor. 


4.2. Adoption of One System of Units 


Instead of using electrostatic, electromagnetic and practical units, it is 
simpler to use the practical units alone in the form of the internationally accepted 
rationalized absolute m.k.s.a. electrodynamic units, which are adequate for all 
theoretical and practical work. 

It is easier to think in one language rather than in three languages which 
require translation one into another! 


4.3. Rationalized Formulae 


Magnetic fields may be represented pictorially by flux lines and electric 
fields by lines of force. With suitably defined unit lines, the density of the 
flux lines may be a measure of the magnitude of the magnetic induction B. 
In the same way the density of the electric lines of force may be a measure of 
the electric intensity E. 

The density of the electric lines of force involves the area of the surface at 
which the density is calculated. Around a point charge this is the area of a 
sphere, 477”, while around a charged wire this is the area of a cylinder, 27rl. 
For a uniform electric field the area of a plane surface is involved, where A=/. } 
for a rectangle, or A= 77? for a circle. 

This is made clear by the use of Gauss’ law for lines of force with electric 
charges in vacuo. It is not necessary to treat 47 as a means of juggling some 
equations to simplify others derived from these. 

In a similar way, with the cylindrical magnetic flux around a straight wire, 
although the flux does not pass through a cylindrical surface around the wire, 
but is everywhere parallel to it, the flux density B at a distance r from the wire 
is inversely proportional to the surface area, 277, of the cylinder of radius r 
and unit length. 

It is preferred, however, to consider the flux density at a distance r from the wire 
to be inversely proportional to the length of the circular flux line, length 277, 
around the wire. ‘This latter point of view is incorporated in Ampere’s circuital 
law. In either case the factor 27 arises naturally with relation to circular 
symmetry. 


y. 
4 


356 E. J. Hay 


4.4. Unified Treatment of Electromagnetism 


In the past, electrostatics and magnetostatics, current electricity, electro- 
magnetism and electromagnetic induction have been studied largely as separate 
subjects, sections of the Gaussian and Kennelly jig-saw puzzles which did not 
fit together readily to form a unified whole. The Gaussian e.s.u.’s and e.m.u.’s 
require a variety of conversion factors. Kennelly bases electromagnetism on 
the B field, and ferromagnetism on the H field. The independent c.g.s. starting 
points, the unit charge and the unit pole, were analagous but bore no direct 
relation to each other. 

Today, electricity is regarded as the study of charges at rest and in motion. 
As a static charge is surrounded by an electric field and a moving charge is 
surrounded by a magnetic field, in addition to its electric field, electricity may 
be defined as the study of electric and magnetic fields. 

The difference in behaviour of electric charges at rest and in motion, the 
static and dynamic behaviour of electricity is fundamental. Yet electrostatics 
and electrodynamics should not be treated as independent subjects but as two 
aspects of the properties of electric charges. Rowland’s experiment is instructive 
here. 

Steady electric and magnetic fields, as of a moving charge in an oscilloscope 
tube, bear a relation to each other depending on the velocity of the moving charge. 
A second moving charge experiences forces due to both fields. 

Varying or moving magnetic and electric fields give rise to the two forms of 
electromagnetic induction, Faraday’s induced electric field produced by a 
changing magnetic field, and Maxwell’s induced magnetic field produced by a 
changing electric field. The latter effect led to Maxwells’ prediction of electro- 
magnetic waves. 

Electric and magnetic fields are not separate and unrelated phenomena but 
are intimately connected. Only when the fundamental inter-relations between 
electric and magnetic fields are understood do we realize the essential unity in 
electricity. It is most desirable that, even at an elementary state at schools, 
our treatment should bear out this unity. 

It is simpler and clearer to treat electricity as a unified subject dealing with 
the phenomena connected with electrons at rest and in motion, and with their 
accompanying electric and magnetic fields, rather than as the largely unrelated 
electrostatics, magnetostatics, electromagnetism and electromagnetic induction. 

The conservative Kennelly system of m.k.s. units perpetuates this disunity by 
) retaining, as far as possible, the old c.g.s. approaches to the different 
branches of the subject. If there is a tendency to teach less electrostatics in 
schools, it will be because the fundamental nature of electricity is not understood. 


5, ELecTRIC AND MAGNETIC FIELDS IN FREE SPACE 


The Lorentz Force Law which gives the total force acting on a moving 
charge in electric and magnetic fields may be expressed (in the teacher’s mind!) 
as, F=QE + Qv x B newton. It is simpler to base all study of electric and 
magnetic phenomena in free space on the E and B fields, on electric intensity 
E and magnetic induction B. 


Electric lines of force and magnetic lines of flux should be made just as 


Electromagnetism for Schools oy, 


“concrete ’ as are rays of light in optics; the grass-seed and iron filings experi- 
ments demonstrating field lines in electric and magnetic fields are highly desirable 
in this respect. 

All reference to D and H fields, and to relative permittivity ¢,, and relative 
permeability u,, may be omitted while we are dealing only with fields in free 
space or in air. 

The E—D and B—H dualisms are confusing to beginners, and make the 
subject unnecessarily difficult so that any system of teaching electricity which 
requires their introduction at an early stage, e.g. the Kennelly m.k.s. system, 
is not suitable for use in schools. Dr. H. F. Boulind of Cambridge, has stated 
in a private communication, ‘ If I were writing a schools’ textbook on electricity, 
I would be tempted to omit all reference to H’. One New Zealand university 
omits the H field with first year classes. 

And a suggested University Entrance Scholarship prescription says: 
‘ Questions in electricity will be formulated using rationalized m.k.s.a. units in 
electricity. Answers to questions on electricity will not necessitate the use of 
magnetic poles, of the electric flux density D or of the magnetic vector H; 
candidates who wish may make use of these quantities.’ 


5.1. The Sommerfeld Convention in Magnetism 


Let us examine the two conventions, previously mentioned, in greater detail. 


~'The Sommerfeld convention defines the magnetic moment of a coil, solenoid or 


magnet, from the relation m= 7',/B ampere metre?, where Ty, is the maximum 
torque, and emphasises the réle of the magnetic flux B. 

On the other hand, the Kennelly convention defines the magnetic moment 
of a coil from the relation, m= T;,/H weber metre, and represents a reluctance 
to drop H as the ‘cause’ of the force on an amperian loop (or magnetic dipole). 

The Kennelly convention accepts 7=mH sin @ newton metre, as the expres- 
sion for the torque or couple on a magnet, but adopts T= NIAB sin @ newton 
metre, for the torque on a coil or solenoid in a magnetic field. Although the 
magnetic properties of bar magnets and solenoids are identical, this convention 
makes the torque on a magnet proportional to the H field and the torque on a 
solenoid proportional to the B field. The separate treatments of the 
magnetic fields of solenoids and magnets do not fit together in the Kennelly 
jigsaw puzzle. 

The Kennelly units are congenial to those who, for historical reasons, wish 
to consider magnetic fields of magnets on the basis of magnetic poles, while 
considering magnetic fields of solenoids in terms of circulating charges in the wires 
of the solenoid. 

In the Sommerfeld units ferromagnetism is a particular case of magnetism 
produced by moving charges. 

So we prefer to adopt the Sommerfeld convention in schools. The 
Sommerfeld convention is widely used in influential American textbooks. 


(See references.) 


5.2. Definition of Magnetic Induction or Flux Density 


Electric intensity E, is defined with reference to the force on a charge in a 
constant and uniform electric field. In the same way magnetic induction B, 


358 E. J. Hay 


a 


may be defined with reference to the force on a moving charge or current in a 
constant and uniform magnetic field. 

An alternative definition of B, often associated with the Kennelly m.k.s. 
system, employs the induced electric field produced by a changing magnetic 
field, briefly called electromagnetic induction. 

It seems unwise to introduce changing fields before a student is familiar with 
constant magnetic fields. The induction definition requires the beginner to 
accept the theory of a search coil fluxmeter, and to understand the physics of 
transient induced currents at an early stage. Measurement of B by the induction 
method involving the interaction of electric and magnetic fields should logically 
be introduced after the properties of constant fields have been fully discussed. 


6. AMPERIAN CURRENTS AND FLUXMETERS 


There is now strong experimental support for Ampére’s suggestion that all 
magnetic effects can be attributed to charges in motion. Magnetism in a piece 
of iron is regarded as due to small Amperian currents (which may be spinning 
electrons) iniron. This view eliminates the need for the concept of the magnetic 
pole and makes Coulomb’s law for the force between magnetic poles unnecessary. 

Magnetic theory may thus be developed, from the beginning, wholly in 
terms of electric currents. The concept of Amperian currents is very important 
in the modern unified treatment of electricity. 

As the concept of pole-strength is no longer required some have 
recommended that magnetometers should be discarded. ‘Though the pole- 
strength of a magnet is not required, experimental methods for determining the 
flux density of magnetic fields, and the flux density in the interior of magnet 
and solenoid are necessary. ‘The force-on-a-conductor method of measuring 
B (flux-density meter) has been mentioned, and the search coil fluxmeter will 
be met later when changing magnetic fields are discussed. 

The instruments previously known as magnetometers appear as two of the 
most useful flux-density meters available in schools. The deflection meter 
may be used to measure the earth’s horizontal component B,,, and also to measure 
B for other magnetic fields, once B,, of the earth has been determined. The 
oscillation meter measures the magnetic moment of a magnet, m, from which 
the maximum flux density in the interior of a magnet, Bm may be a 


7. M.K.S. Units witu Purits In SCHOOLS 
7.1. Changing from C.G.S. to M.K.S. with Young Students 


In the four New Zealand universities c.g.s. units in electricity are not usually 
introduced until at least the third university year, when it becomes necessary 
for the student to be able to read classical physics literature. 

Teaching c.g.s. units to pupils for one or two years and then expecting them 
to change to m.k.s. units, is almost equivalent to teaching a child to learn to 
speak in French, and then at the age of three years, suddenly changing to speaking 
in English. The vocabularly, idioms and method of expression differ in import- 
ant respects which must confuse the child as it certainly confuses many teachers. 
Introduction at a late stage of m.k.s. units involves a redefinition of some physical 
quantities, the wholesale revision of basic formulae and the logical deduction 


of the subject on a new foundation, the absolute ampere. 


Electromagnetism for Schools 359 


It is true that both languages may be used to describe identical experiences 
and phenomena, but it is simpler for the average child to use one language 
from the earliest years. The bright child changes readily from c.g.s. to m.k.s. 
but the average and below average are handicapped for a considerable period. 

For these reasons we believe gaussian c.g.s. units should not be taught in schools. 


7.2. Thinking in M.K.S. Units 


No student can speak fluently in a foreign language until he has learned to 
think in that language. Teachers used to c.g.s. units tend to think in c.g.s. 
units and find m.k.s. units clumsy. It is essential to learn to think in terms of 
metres, joules and watts, coulombs and farads instead of in cms, dynes, ergs, 
e.s.u. of charge, potential, capacitance, and e.m.u. of current, potential and so on. 
Think 1 micron=10-® metre, rather than 10-4cm; 1 mm=10-? metre, rather 
than 10-'cm; 1 dyne=10-° newton, rather than 1 newton=10° dynes; and 
lerg=10~" joule, rather than 1 joule=10’ ergs. 

Rough every day equivalents for many of the commonest m.k.s. units are most 
helpful in giving young pupils an appreciation of the magnitude of unfamiliar 
m.k.s. units. 

A metre is half the height of a door. 

A kilogram is a mass equal to that of about two pounds of butter. 

A second is the time required to say a five syllable word, e.g. electricity. 

A newton is the force equal to the weight of 102 grams or 3:6 oz. wt. 


One joule is the energy lost by a mass of 0-1 kg rolling off the table and 
falling on your toe. One joule of heat energy is released by a small drop of 
boiling water cooling on your skin. 

The watt is approximately the power developed when using a rotary pencil 
sharpener. 

An ampere is the current flowing in the 6 v 6 w bulb used in the rear tail 
lights of many cars. 

One coulomb of electricity is the quantity required to deposit 1/3000 gram 
of copper. One coulomb is also about 1/50 of the moving charge in a lightning 
stroke. 


7.3. How NOT to proceed 
It is necessary to voice a vigorous protest against the use of c.g.s. unrationa- 
lized formulae and numerical constants, instead of the rationalized m.k.s.a. 
formulae, by teachers who are changing from the traditional to the modern 
approach. Examples: 


Rationalized m.k.s.A. Unrationalized c.g.s. with constant 
F as 010s newtons Late 0» newtons 
€y 4797" r 
ee Lgl. F atid 
7 = Ho = newtons 7 =k, : = newtons 
Aa NI 
jojayir a weber/sq.m. beaks - weber/sq.m. 


The author studied electricity based on c.g.s. units thirty years ago, and when 
the advantages of m.k.s. units were seen, preferred to retain c.g.s. formulae, 


360 Ee ejeenlay, 


if at all possible. ‘The above c.g.s. formula compromise was tried in an elemen- 
tary class, but later caused confusion with weaker pupils when they met rationa- 
lized formulae in the Sears and Zemansky textbook. It is to the advantage of 
teachers and pupils alike to abandon the c.g.s. unrationalized formulae, and 
gain confidence in the rationalized m.k.s.a. formulae. 

For the modern approach is not just a matter of altering constants in formulae 
so that one set of units can be used, or simplifying the teaching of a unique 
logical treatment. ‘There is no unique treatment. 

The new approach considers electric and magnetic fields, and current electri- 
city as properties of the electric charge; and chooses as fundamental those laws from 
which other laws may be deduced most readily. 


Part II. Notes on Teaching Sequence 


1. CONSTANT FIELDS 
1.1. Electric Field E of Charges in Free Space 


1. Force between Charged Bodies. Experimental demonstration of force 
between charged bodies. Law of force between static charges. Rutherford 
atom and electric charge, ‘ electric substance’ carried to electroscope by proof 
plane. 

Force on a charge Q, in the spherical field of another charge O,._ Coulomb’s 
Law. 

i is 10> newton, where = Oar oc LO? 
Ey 4 ar €9 
1 Oi05 


«,° area of sphere of radius 7 around Q,. 


Units of F, Q,, Q, and r previously specified, and value of electric constant 1/¢, 
found by experiment to be 367 x 10° m.k.s.a. units. 


2. Electric Fields and Lines of Force. Demonstration of electric field lines 
by grass-seed (crested dogstail) on carbon tetrachloride covered with layer of 
kerosene and field of 1000 v/cm from neon-lamp transformer (safeguards). 
Pivoted ebonite strip, one end rubbed with flannelette and the other with poly- 
thene film, acts as an electric compass with constant fields from Wimshurst 
machine. Recently powdered gypsum, and cork filings also used to plot 
electric fields. Plot electric fields of point, line and plane charges. 

Define electric intensity of an electric field by relation, 


E=F/Q newton/coulomb, thus F= EQ. 


3. Electric Field around a Point Charge. From Coulomb’s Law and defini- 
tion of electric intensity EH, deduce magnitude of spherical field around a point 
charge, 

LeahQ 


€y 409" 


ee Q 


€y area of sphere of radius r around O 


newton/coulomb 


Electromagnetism for Schools 361 


Quantitative representation of field FE by lines of force. Define unit line. 
Magnitude of field E, represented by lines of force density in lines/metre?. 


4. Application of last formula in differential form (not required in schools). 


5. Gauss’ Law as an application of Coulomb’s Law. Field of point charge 
deduced from Coulomb’s law and definition of E. Gauss’ Law is a generaliza- 
tion from relation for field of point charge. 


Gauss’ Law, N=EA= we O, thus pee Hence il or 367 x 10° 
Eo —, 4 €o 
lines of force emanate from each coulomb of positive charge. 


Applied to deduce field round charged wire 


ea@) 
E = Ss es 
ail newton/coulomb 


Le 
€) area of cylinder of radius 7 around wire 
Applied to field between charged plate 


E pee. newton/coulomb. 

€9 A 

6. Potential Energy W and Electric Potential V. Hence V=W/Q joules/ 
coulomb or volts. 

Relation between electric field and potential, E= V/S volt/metre or newton/ 
coulomb. £ also measured by lines of force density in lines/metre®. (This 
parallels magnetic flux density B in weber/metre?.) 


Gold-leaf electroscope as an electrostatic voltmeter or electrometer. 


7. Capacitors and Capacitance. Relation defining capacitance C=Q/V 
farad. Size of coulomb and farad. Sub-multiple prefixes. Capacitors in 
parallel and series. Capacitance of sphere and parallel plate capacitor. 

Demonstration of movement of charge in charging and discharging capacitor 
through high resistance, using microammeter and electrostatic voltmeter. 


1.2. Electric Fields of Charges in Polarizable or Dielectric Materials 


1. Effect of material between charged bodies on force between charges by 
experiment. A uniform dielectric filling a region where the field is strong. 
2. Modified Coulomb’s law for dielectric media involving relative per- 


meability é;. 
a 1 O02 
ey ra ae: 


3. Modified relation for electric field 


newton for point charges. 


: 02 newton for point charges. 


£ - Eger Anr* 
1 : 
4. Modified Gauss’ Law. N=EA= ae O lines. 
o&r 
; : : 1 O 
5. Modified Potential relation V= ee VOI, 
Eyer tur 


6. Modified relation for capacitance of sphere and parallel plate capacitor 


C=e,¢r.4nr farad., and C=e er. A/d farad. 


362 


E. J. Hay 


1.3. Magnetic Field B of Moving Charges in Free Space 


1. Force between Currents. Measurement of Current: 


(a) 
(0) 


(c) 


(¢) 


Demonstrate force between parallel currents. Law of parallel currents, 
like currents attract; and contrast with law of charges. 
Current as a flow of something, carries charge observed in electrostatics, 
e.g. sparks, proof plane charging electroscope by spoonful, charging 
capacitor, Millikan oil-drop experiment. 
Flow measured as time-rate of transport, measured by quantity per 
unit time. Thus current=charge/time. J=Q/t coulomb/sec. or 
ampere. 
Electron drift in metals. Identity of moving charge and current element. 
Need to measure current, as there is no electrostatic method of measuring 
charge of the order of coulombs. Accurate measurement of moving 
charge or current by force between wires. 
Force between charges, Foc Q,Q,; thus we should expect force between 
currents Po l,l,. 
By experiment, Ampére found force between parallel currents of unit 
length, F/laJ,J,/r. If we assign constant proportionality the value 
[4p /277, then 

F IAG 


Fae ae newton/metre. 


The force on one current is inversely proportional to the length of the 
circular flux lines around the other current, thus we put the factor 2ar 
in denominator. 

Give at this stage the new definition of m.k.s.a. absolute ampere, which, 
in effect, specifies the value of py. 


(f) Current balance. 


(g) 


Schools should have a suitable demonstration current balance. The 
parallel-loop current balance requires two single turn coils of S.W.G. 
18 insulated copper wire, of diameter 20 cms, with ends twisted together. 
One coil lies on the baseboard of the balance and other coil is suspended 
from the beam by linen threads. Six centimetre lengths of 15 amp fuse 
wire are soldered to ends of the suspended loop and bent to dip in 
mercury cup contacts. Suspended loop is adjusted parallel to fixed 
loop and at a distance of 1-2 mm apart. Currents of 5-15 amps are 
used. ‘Twin multiple-coil, balances are more sensitive but less simple 
in theory. 

Ampére’s current element force law for parallel Current Elements. 
Lidell, 
4qr? 
This law is the basis of all magnetic theory. Although not directly 
demonstrable the law is accepted as a basic theorem, because laws 

deduced from it can be proved by experiment. 


F=p newton, 


2. Magnetic Fields of Currents and Magnets, that is, Temporary and Permanent 
Magnets. (Qualitative.) 


(2) 


Solenoids and bar magnets with magnetic materials. Suitable solenoids 
formed by winding one pound of 18 or 20 gauge insulated copper wire 


(6) 


(¢) 


(4) 
(¢) 


Electromagnetism for Schools 363 


on a cardboard tube of length 15cm and diameter 25cm. Use a 
12 v source, and 1-ohm 15-amp rheostat, if desired, to limit current and 
reduce rate of heating. Show the attraction of tacks, nails, iron filings 
etc., and the difference between magnetic and non-magnetic materials. 
Solenoid and bar magnets as compasses. Law of magnetic poles. 
(Solenoid compass may be of the suspended or pivot type. To con- 
struct a suspension compass wind 80 metres of 20 or 22 gauge insulated 
copper wire on a cardboard tube 20cm long and 1-2cm diameter. 
The solenoid is suspended by fine wire or phosphor bronze strip, and 
the two ends brought out, one at mid point of lower surface, and the 
other at the side, to dip in mercury in a central trough and a surrounding 
annular or ring trough. The troughs are cut with a wood lathe and 
connected by wires to terminals. A current of 4amps from a 12v 
battery gives a magnetizing field of about 2000 amp turns, which is 
sufficient to cause a slow swing round till the axis lies in the magnetic 
meridian.) 

Forces between suspended solenoids and magnets. Apply law of 
parallel currents and law of magnetic poles. 

Solenoid-like properties of a bar magnet suggest electron-currents in 
magnet. Mention nucleus surrounded by circulating electrons in iron 
atoms. 

Meaning of term magnetic field and earth’s magnetic field. 

Plotting magnetic fields of solenoid and bar magnet with iron filings, 
compass and small suspended plotting solenoid. Parallel currents 
enable prediction of alignment of plotting solenoid. Rules for direction 
of flux lines—right hand grasping rule, and facing the clockwise current 
rule. (The large solenoid is the one constructed for use in testing 
attraction of magnetic materials. The small plotting solenoid consists 
of 50 turns of 26 or 28 gauge insulated copper wire wound on a cardboard 
tube 3 cm long and 1 cm in diameter. The two free ends from centre 
of solenoid are wound around a piece of 10 gauge wire to form a spiral 
from which plotting solenoid is suspended. End in which current is 
flowing anti-clockwise painted red to indicate N seeking pole.) 
Emphasise the identity of the magnetic fields of solenoid and bar magnet. 


(f) Mention Rowland’s experiment on the magnetic fields of moving charges. 
(g) Magnetic field of ring solenoid and ring magnet. 


(/) 


Magnetic field of coil and ‘ magnetic shell’ magnet. Plot the field with 
iron filings on a card through axial plane of coil, also with compass and 
plotting solenoid. 

Demonstrate the force on flexible straight conducting wires held on a 
bow, in various parts of the field of the coil. Then give the force-on- 
current definition of flux lines. (A large coil is required, constructed 
by winding one pound of 20 gauge insulated copper wire on a wood 
former, 15cm in diameter and 2:5cm thick. Use a 12v current 
supply.) ao 

The holder with loose flexible wire, like a violin bow with loose strands, 
requires 4 strands of SWG 26 or 6 strands of SWG 28 wire. ‘I'wo 
48 w or three 36 w lamps connected in parallel, in the circuit of the 
flexible wire gives momentary heavy current. 


364 


(7) 


(J) 


BEjieelay 


Ampére’s statement on magnetic fields of currents and magnets. Any 
permanent magnet develops a flux distribution similar in shape to that 
of the electric circuit in a coil wound around the same boundary as that 
of the magnet. 

Field around straight current, using filings and compass. Oersted’s 
experiment. Cylindrical field with circular flux lines. Corkscrew 
rule. 

Field around a current element. Use one metre of 20 gauge wire folded 
at the centre point to give parallel wires 3 mm apart. The short 3 mm 
section at the bend may represent a current element. Show field of 
spherical symmetry. The closed loop characteristic of magnetic lines 
of flux should be contrasted with electric lines of force which end on 
charges. 

Pupils must feel at home with magnetic field flux lines before any idea 
of magnitude of field is introduced. 


3. Force on a moving charge or Current in a Conductor. Measurement of 
Magnetic Fields. It is convenient to separate the effect of one moving charge 


(i) 
(ii) 


(a) 


(5) 


(c) 


(d) 
(¢) 


or current upon another into two parts: 
One moving charge is said to produce a magnetic field. 
This magnetic field produces a force on a second moving charge. 
It is preferred to consider the second aspect first. 
Demonstrate the force on conductor with a wire in the field between 
poles of strong horseshoe ticonal or hycomax magnet, or electromagnet. 
Left hand motor rule. Varying strength of field seen with the force 
on the flexible wires in the field of the coil. 
Measurement of magnetic fields. Force-on-Conductor Balance or 
Fluxmeter. 
Balance shows Foc Jl, thus F=const. J/. The constant is a measure of 
flux density of field and given symbol B. Then F=B// newton, and 
B=F/Il tesla. Unit of B is newton/amp metre, which, by international 
agreement in 1956, has been called the tesla. As flux is measured in 
webers, flux density is also measured in weber/metre?. 
(A force on conductor balance may be constructed using a U-shaped 
ticonal, alcomax or hycomax magnet of moderate size, and a rectangular 
coil of 20 turns, height 3 in. and width 24in. The coil is fixed on a 
# in. thick wood base and connected to terminals. The magnet is 
attached by tape and drawing pins to a short wooden rod which is 
suspended from the beam of a balance. Current is passed in a direction 
to give repulsion. 
Flux and flux density, 6=BA unit, the weber. The weber is a large 
unit. It is convenient to retain the e.m.u. flux line as a sub-unit of 
the weber, when 1 weber= 108 flux lines. 
Force on current element 

dF=Idl B sin @ newton. 


Force on a moving charge. Lorentz Force Law 
F=QvB newton (vand F | B). 


(f) Lorentz force law F=EQ+QvB for charges moving in parallel paths. 


(vand F 1 B). 


Electromagnetism for Schools 365 


(g) Torque on a coil, T= NIAB sin @ newton metre. 

(h) Magnetic moment of coil and solenoid, m=NIA amp. m2. Since 
Tmax=mB newton metre, flux density B may be defined, B= Tmax /m 
weber/sq. metre for coil. 


1.4. Magnetic Flux Density of Fields due to Currents 


Experimentally, the flux density of a field is measured by a flux-density meter. 

Four types are available in schools: 

(a) Force-on-conductor fluxmeter. 

(6) Deflection or tangent fluxmeter. 

(c) Oscillation fluxmeter. 

(d) Search coil fluxmeter. 
Types (6) and (c) are mentioned after properties of magnetic materials. 
Type (d) involves the inductive measurement of flux and _ the 
properties of non-constant flux. 


1.5. Magnetic Field B of Moving Charges in Polarizable or Magnetic Materials. 
Amperian Currents 


(a) Methods of magnetizing and demagnetizing. Induced magnetism. 

(b) Magnetic properties of steel and soft iron. Electromagnet. 

(c) Amperian currents approach. Start with observed forces between 
parallel currents rather than Coulomb’s law for poles. 
Flux density is defined, 


B=F/Il weber/sq. metre, for straight wire 
B=Twmax/m weber/sq. metre, for coil. 


Iron acts as if its atoms contained circuits behaving like coils carrying 
current. In a magnetic field these atom electron circuits behave like 
minute plotting solenoids, and align themselves with their axes in the 
direction of the external field. The field of the iron is referred to the 
nett uncancelled amperian currents on its lateral surface. ‘The iron 
in an electromagnet is practically equivalent to an amperian current 
running alongside the conduction current. A permanent bar magnet 
with its amperian circuits aligned is equivalent to a solenoid. 

A decision to introduce magnetism by the Amperian current concept 

leads almost inevitably to an emphasis on B, to the correlation of B with 

E (Electric intensity), and to preference for the Sommerfeld definition of 

magnetic moment (m= T'/B). 

(d) Ferromagnetism and electron structure of the iron atom. 
(e) Theory of magnetized materials based on Amperian currents. 
(f) Fluxmeters employing the torque on a suspended manget. 

(i) Deflection or Tangent Flux-density meter and Theory. Used to 
measure B,, of earth’s horizontal field, and B near straight wire, 
coil, solenoid or magnet. 

(ii) Oscillation Flux-density meter. 

Used to compare flux density of magnetic fields, and to measure magnetic 

moment of a bar magnet and thus calculate maximum internal flux 


density. 


366 BE. J. Hay 


Note——Magnetic fields of moving charges with magnetic materials will 


require introduction of relative permeability pr. 
The H field will be required only in the topic hysteresis. 


2. CHANGING OR VARYING MAGNETIC FIELDS 
2.1. Electromagnetic Induction based on Faraday’s Law 
Changing magnetic field in a circuit induces an electric field and an e.m.f. 


1. Faraday’s experiments. 

2. Motional or dynamically induced e.m.f. with flux cutting. Relative 
motion of circuit and field. May explain effects in terms of Ampére’s magnetic 
force and relation f=J/B newton. 

Motional e.m.f. V=Blv v. 
Right-hand dynamo rule. 

3. Statically induced e.m.f. with flux linkage. Circuit stationary and 

magnetic field varying in density. Not directly related to F=JIB. 
Faraday relation, Vo dB/dt (N and A constant) 


Faraday-Neumann Law V=— a Vv. 
But d=BA, thus V =—N o 


Il 


—NB - Motional e.m.f. 


or =—-NA = Statical e.m.f. 


4, Induced current and charge. 

5. Lenz’s law as an application of Le Chatelier’s law. 

6. Faraday disc generator. Faraday disc in slightly different category. 
The circuit has a moving arm but a fixed area. This involves accelerated 
rotational motion of a conductor, and 

V = —d¢/dt v 
is still valid, provided d/dt is interpreted as the time rate of cutting flux. 

7. Lorenz absolute ohm experiment. 

8. Search coil fluxmeter. 

9. Galvanometer and damping. 

10. Transformers and eddy currents. 

11. Self induction and mutual induction. 


REFERENCES 


General: 

Coulomb’s Law Committee of the American Association of Physics ‘Teachers, 1095, 
‘The Teaching of Electricity and Magnetism at the College Level’, Am. Jour. Phys. 
18, 1-25, 69-88. 


School and Funor University: 

Hotton, G., and Rotter, D. H. D., 1958, Foundations of Modern Physical Sci 782 
(Reading, U.S.A.: Addison-Wesley). if aS SRE 

Sears, F. W., and ZEMANSKY, M. W., 1952, College Physics Pt. II, Electricity, Magnetism 
and Optics, p. 499 (Reading, U.S.A.: Addison-Wesley) (No calculus). Or 


Electromagnetism for Schools 367 


RicHaRDs, SEARS, WEHR and ZeMANSKy, 1960, Modern University Physics. (Reading, 
U.S.A.: Addison-Wesley) (Calculus). 

RESNICK, R., and Hatuipay, D., 1960. Physics (London: Wylie). 

‘THompson, M. A., 1960, MKS Units in Schools, p. 41 (Auckland University, N.Z.) (Local 
circulation). 


Pout, R. W., 1930, Physical Principles of Electricity and Magnetism, p. 356 (London: 
Blackie & Son.). 


Intermediate University: 


Bitter, F. H., 1956, Currents, Fields and Particles, p. 598 (London: Chapman & Hall Ltd.). 
Peck, E. R., 1953, Electricity and Magnetism, p. 476 (New York: McGraw Hill). 
ConsTAaNntT, F. W., 1958, Theoretical Physics—Electromagnetism, p. 364 (Reading, U.S.A.: 


Addison—Wesley). ‘This book gives the clearest outline of the structure of electricity 
known to the author. 


Advanced University: 


Kraus, J. D., 1953, Electromagnetics, p. 604 (New York: McGraw Hill). 
SmytTuHeE, W. R., 1950, Static and Dynamic Electricity, p. 597 (New York: McGraw Hill). 
Panorsky, W. K. H., and Puituips, M., 1955, Classical Electricity and Magnetism, p. 400 
(Reading, U.S.A.: Addison-Wesley). 
StraTTon, J. A., 1941, Electromagnetic Theory, p. 615 (New York: McGraw Hill Co.). 
SOMMERFELD, A., 1952, Electrodynamics, Lectures on Theoretical Physics, Vol. III U.S.A.: 
Academic Press). 
Most of the above are listed as standard references by one or more of the four New 
Zealand universities. 


The Physical Science Study Committee 
(4) : The Working Session on Physics Teaching at the 
Cavendish Laboratory, Cambridge 


by URI HABER-SCHAIM 


Educational Services Incorporated, Watertown, Massachusetts, US 2A. 


The Working Session on Physics Teaching in Collaboration with the 
Physical Science Study Committee was organized by the Directorate for 
Scientific and Technical Personnel of the Organization for European Economic 
Cooperation (now the Organization for Economic Cooperation and Develop- 
ment). It took place at the Cavendish Laboratory, University of Cambridge, 
in July-August 1961. A group of thirty physics teachers met for four full weeks 
and actually spent between 8 and 12 hours a day, 5 to 6 days a week, in studying 
a new physics course in detail. 


1. THE BACKGROUND 


Knowledge of the existence of the physics course developed in the United 
States, with all its learning aids—text, laboratory guide and apparatus, films, 
and tests—began to diffuse around the world three or four years ago. At this 
time the course was still being developed and tested. Numerous requests for 
information were received by the Physical Science Study Committee; many 
visitors from abroad called at its offices to get first-hand information. Such 
visits, as well as short articles published in educational journals, merely gave 
people the feeling that a more thorough study of the materials was necessary to 
obtain a clear picture of the ideas and merits of the new course. 

In the summer of 1959 several regional organizations, in co-operation with 
the National Science Foundation, supported the participation of physics teachers 
from various countries in summer courses in the United States. These summer 
courses, of 6 to 8 weeks’ duration, are designed to train American teachers to 
give the PSSC course; they go very thoroughly through the material, elaborating 
on the philosophy of the course. Five participants from Europe were sponsored 
by the National Science Foundation and the Organization for European Economic 
Cooperation. ‘These European participants were very experienced in their 
field and, although they enjoyed living in the United States for the summer and 
meeting American teachers and their families, they felt that the American 
teacher-training institutes were not the most efficient way for them to gain 
knowledge of the PSSC course. Nevertheless, as a result of their participation 
in the summer courses in the United States, interest in the PSSC course con- 
tinued to rise rapidly, and in the summer of 1960 another group of European 
teachers participated in American summer institutes, under the sponsorship of 
the Organization for European Economic Cooperation and the National Science 
Foundation. 

It should be noted at this point that the participants from other countries 
were dispersed among several institutes and, since in no case were there more 
than one of these participants in any single institute, there was no opportunity 
for exchange of ideas on how this course, or parts of it, could be used in their 


The Physical Science Study Committee 369 


own countries. Visualizing how a course created for secondary schools in one 
country might be taught in the secondary schools of another requires con- 
siderable study. Although physics itself is universal, the background of the 
students, traditions of teaching, and preparation in related fields (such as 
chemistry and mathematics) differ greatly from country to country, as do also 
the number of hours spent per week and the total hours per year. Needless to 
say, the transportation of large numbers of teachers from Europe to the United 
States for the summer courses was a rather expensive proposition. If this had 
been the only way of familiarizing European physics teachers and supervisors 
with the PSSC course, the number of participants would necessarily have 
remained small, and the burden of evaluating the course for home consumption 
would have remained on the shoulders of the few teachers who had studied the 
course in the American summer institutes. 

In the autumn of 1960 the suggestion was made to the Division of Scientific 
and Technical Personnel of the Organization for European Economic Co- 
operation, that instead of bringing a few teachers to the United States for the 
summer of 1961, a few American staff members should be sent to Europe to 
conduct an institute specifically designed to the needs of European teachers. 
This suggestion was accepted by OEEC and thus the way was paved for the 
Cambridge Session. 

It is well known that the European school system is very different from the 
American school system, but a closer look reveals that there are also large 
differences among the various European school systems. ‘To add to the com- 
plications of organizing the Working Session, not all of the countries represented 
at the Cambridge Session had the same interest in the course. Some countries 
intended to try the course out on a limited scale, in so-called pilot schools, in 
order to get a valid opinion of its usefulness in their own circumstances. Other 
countries merely wanted to be represented at the Session without planning any 
‘follow-up ’ action at the time. OEEC decided that the countries in the first 
category would be allowed to send a delegation of 2 to 4 persons, whereas 
countries in the second would send one participant only. 


2. 'THE ORGANIZATION OF THE COURSE 

As previously mentioned, the Session took place at the Cavendish Laboratory, 
which provided the necessary space and the services of very competent labora- 
tory assistants. The procurement of laboratory equipment and films was 
arranged by Educational Services Incorporated, the organization which 
administers the PSSC affairs. Local arrangements were carried out by the 
office of the Institute of Physics and the Physical Society of London. The 
Physical Science Study Committee sent four of its veteran members to staff the 
institute: Professor Francis L. Friedman, Massachusetts Institute of Technology ; 
Professor Philip Morrison, Cornell University; Mr. Judson Cross, St. Mark’s 
School, Dallas, Texas; and Dr. Uri Haber-Schaim, Educational Services 
Incorporated. 

A typical working day was divided roughly as follows: 

First there were about one and a half hours of lectures on course material, 
with special emphasis on the ‘ hows and whys’. ‘These lectures were divided 
between Professor Friedman, Professor Morrison, and Dr. Haber-Schaim. 
Sometimes films were shown during the lecture to demonstrate the use of films 


C.P. 1A 


370 Uri Haber-Schaim 


in physics teaching. There followed a quarter of an hour of discussion of actual 
school experience by Mr. Cross. 5 

After the morning coffee breaks, participants divided into three discussion 
groups in which questions of philosophy and local usage of the homework 
problems and tests were discussed in some detail. 

The afternoon session was devoted to laboratory work, followed two or three 
times a week by a film session. 

It was assumed that all participants had read the chapters to be discussed 
ahead of time, and with the high calibre group on hand there was, of course, 
no need to present the content of the textbook in detail. The lecture time was 
therefore effectively used to explain the structure of the material and the reasons 
behind its selection. So, for example, Professor Morrison explained the reasons 
for having a section on scaling early in the course (Chapter 4). Professor 
Friedman took time to elaborate on the treatment of the conservation of 
momentum and Newton’s third law of motion (Chapter 23) and the reasoning 
behind the rather different treatment of electricity. 

The discussion groups were divided as follows: two groups consisted 
primarily of delegates from countries which planned to try the course in pilot 
schools (Spain and Yugoslavia in Group 1, the Scandinavian countries in 
Group 2). The third group comprised representatives of Central European 
countries. Staff members served as moderators in these discussion groups and 
rotated during working sessions to gain maximum personal contact with all 
participants. ‘The activities pursued in these study groups varied from one 
group to another and were determined to a large extent by the interests of the 
participants. ‘The members of Group 1 were charged by their governments 
to prepare themselves to teach the PSSC course in the spring of 1962, and 
therefore went into considerable detail in the teaching problems and the ‘ Home, 
Desk, and Lab ’ (these are the homework problems of the text). The members 
of Group 2, which plan to have pilot schools in the autumn of 1962, took time 
out for thorough discussion of possible additions and changes to suit their 
students. The third group, which was the most heterogeneous, devoted part 
of their time to a frank exchange of opinions on the teaching of physics and the 
philosophy of science in addition to discussing suggested homework problems 
and tests. 

The afternoon laboratory sessions were probably the most unorthodox part 
of the whole Working Session. ‘The PSSC laboratory kits were prepared by the 
Cavendish Laboratory assistants under the supervision of Mr. Cross. The 
participants then proceeded to actually perform and analyse the experiments 
as suggested in the laboratory guide. This means that they began by studying 
the equipment, taking the measurements, and analysing the results, just as a 
student would do. As many of the PSSC experiments are assigned to students 
before they have covered the corresponding topics in the text, the participants 
had, in some cases, to ‘ forget ’ their previous knowledge of the expected results. 
In other cases this was not necessary, because although the theoretical answer 
was known, the accuracy of the particular piece of apparatus was not. All the 
staff members were present in the laboratory during these hours and were 
available for consultation, On many occasions this detailed study of the 


laboratory material resulted in useful comments for improvements in the 
laboratory guide itself. 


The Physical Science Study Committee svt 


It was felt advisable to use this rather time-consuming procedure in the 
laboratory instead of a quick demonstration of the equipment, because the topics 
covered by the experiment, their function in the course, and the nature of the 
equipment were all rather new to European teachers. In the traditional physics 
course, both in Europe and in the United States, laboratory exercises are used to 
measure certain constants, such as density of a liquid, thermal expansion 
coefficient of a piece of wire, specific heat of a metal, and the like. A large 
fraction of the PSSC experiments, on the other hand, serve to ‘ discover’ 
fundamental laws of physics such as Snell’s law of refraction, Newton’s law of 
motion, Coulomb’s law, and others. By properly timing the laboratory work, 
the experiments then serve also to introduce new ideas and concepts. 

The PSSC laboratory equipment is made of simple, sturdy components, 
often for as little as 10 per cent of the cost of the traditional equipment. Many 
participants admitted that they would never have believed they could get 
meaningful results with the equipment had they not tried it for themselves. 

Late afternoon sessions were particularly devoted to screening the PSSC. 
films. Of about thirty films screened during the Session, only about ten were 
shown at the morning lectures. The screenings were followed by discussions. 
of the use of films. 

The last four days of the Session were devoted particularly to further work 
in committees and discussion groups for the purpose of preparing reports and 
formulating further plans. Some time was also devoted to preview lectures on 
additional material now being developed at ESI. Summaries of the reports 
were submitted in writing and distributed among the participants, and extensive 
oral reports were presented in plenary session by representatives of the 
committees. 


3 CONCLUSIONS 


It is natural that most of the discussion time was devoted to criticisms and 
points on which there were disagreements; nevertheless, reading the reports 
of the discussion groups and their sub-committees one finds that there was a 
wide range of agreement as to the merits of the PSSC course. . It was generally 
granted that no matter how many hours of physics are available in the schools 
a selection of material has to be made and that the selection of the PSSC is a 
good one, though by no means the only good one. It was also the general 
consensus that the text succeeds in treating classical as well as modern physics 
from a unified point of view and that it provides interesting and stimulating 
reading. ‘The laboratory is used more effectively than in the traditional 
European school. It was generally agreed that the large-scale co-operation 
between university professors and high-school teachers was a welcome precedent 
for other countries. 

Reservations on the value of the course, or the approach it advocates, to 
local conditions had their origin in the differences in the previous knowledge of 
students in physics, chemistry and mathematics and also the distribution 
of teaching hours. Another important factor was the deep differences in the 
social climates as expressed for example in the student teacher relationship. 

Some participants feel that students in some of their classes could use a more 
advanced mathematical approach in dealing with the physics, but the comment 
was also made that too often the teaching of physics is reduced to drill in algebra. 


ave Uri Haber-Schaim 


Table 1. Hours of Physics Teaching per week in Schools of OEEC Countries. 


Age_~ 
12" 132-14 AS 16 ee oe 


inc. chem. 
Austria Gymnasium ye See ae YS 
Realgymnasium QED 3 | 6 el ee. ee ame 
Realschule 2.238 53 G2 ee 
Belgium Wetenschappelyke Afdeling 
Latijns-Wiskundige 1 Sol, pete) ae 
Latijn-Wetenschap Pelijke [Oe A oc ae, 
Grieks-Latynse d Wie wae! Iya LL 
France Lycée Classique 2a 
Lycée Scientifique 2 3. 4 — 
Lycée ‘Technique Dae 2 3 fae 
Greece Classical 2-4 2-4 
Scientific 3 46 46 
Ireland Technical School 4 4 4 4 — 
Secondary School 2) 2A 8t WS as 
mech- 14 14 — 
anics 
Italy Liceo Classico Da eB 
Liceo Scientifico 25s 3s 
Netherlands Gymnasium 
(science line) 2 32 PS ees 
(languages line) 2 2 5 ee 
Secondary school 
(science line) 2 3 4° 4 
(languages line) pS pee, pe 
Norway Science Line 202? ae  Oneay 
English and Latin Line D Ds ee es Pa 
Portugal Liceu 
(chem. and phys.) Be 3 5 oa 
(science students) 
Liceu 
(chem. and phys.) Ue a le 


(arts students) 


eee 


Spain Institute Nacional de Ensenaza 
Media 3 — 4 3 
Escuela Tec. Media Sie ES 
Escuela Maestria Industrial yy) 3 


Collige Gymnase 


Switzerland Classique PRM ale pin ea 
(vaud) Scientifique 2) Sas ee 
Technique 2) 2 
SS a 
United Kingdom Grammar School 230 2) Sao tee SmeeeS 
asa Se ee aii Bi 
Yugoslavia Science Line 2 lee pmeshion 3) 
Social Science Line Di i as 


The Physical Science Study Committee 373 


In some countries physics is taught in two cycles, and it was felt that the course 
in its present form would not be rigorous enough for the upper cycle but that 
certain parts would be too hard for the lower cycle. In this connection one has 
to remark that in the United States, too, the PSSC is only one link in a chain of 
plans which are under way to extend the revision of physics and chemistry 
teaching to lower grades. In due time, therefore, the other links in this chain 
will exert an influence and a revised sequence will emerge. In this connection, 
a popular misconception was clarified: many people believed that in the United 
States a one-year physics course with about five hours per week, 180 hours 
per year, is all the physics a student can get. Although a physical science course 
in the ninth grade (age group 14-15) in many places leaves much to be desired, 
students have the opportunity to study elements of physical science (Archimedes’ 
principle, levers, etc.) before they embark on the PSSC course. Many schools 
have general science programmes in their seventh through ninth grades—that 
is, before they embark on specialized programmes in physics, chemistry, and 
biology. Furthermore, although the physics courses in the various classes in 
secondary schools are traditionally taught in one year, the number of hours 
varies from school to school, being usually no less than five and up to seven 
and eight, and sometimes even ten. This makes it rather convenient to schedule 
double periods for laboratory work. During the Working Session, participants 
were asked to compile a table indicating the number of hours per week and the 
number of years spent on physics. ‘They responded to this request, working 
from memory. Though there may be some inaccuracies I believe that the general 
picture that one can get from these tables justifies their inclusion here. 

If we compare the numbers in these tables with the ten hours or more avail- 
able to the American student between the ages of fourteen and eighteen, we see 
that some European schools do not have enough hours to teach the PSSC course, 
and that although they spread their physics over a larger number of years, the 
total number of hours is in many cases only comparable to that available in the 
United States. This does not imply, of course, that where the number of hours 
is about the same, one can just go and spread the course over a longer period. 
Nevertheless, some participants thought it worth trying. 

The PSSC text uses the MKS system which in the United States is slowly 
beginning to replace the English units and in Europe the CGS system. But the 
PSSC also takes notice of the fact that nature provides an elementary electric 
charge and it uses this as a unit of charge in addition to the macroscopic 
Coulomb. This raised objections on the grounds that the UPAP Commission 
on Units has not incorporated the elementary charge as one of the fundamental 
units but uses the ampere as a fundamental unit of current. It was pointed out 
in the discussions that the UPAP Commission based its selection of units on the 
present status of technology, which determines the units that can be measured 
most accurately, whereas the charge on the electron seems to have a much more 
natural and fundamental significance. 

The largest diversity of opinion was reserved for films. ‘his is a character- 
istic of films rather than of the teachers; it has occurred on other occasions both 
in the United States and other countries. One can hardly find a film which 
would not be highly praised by one person and violently objected to by another. 
The evaluation of films was particularly difficult owing to language barriers. 
Though most of the participants read English freely, some found it hard to 


374 Uri Haber-Schaim 


follow the films with the English sound track with American pronunciation. 
An interesting example is the fact that ‘ Crystals’, which is among the most 
highly rated films in the United States and was very well received by the 
participants from the United Kingdom, was dismissed as * too much talk’ by 
almost everybody else. It was generally felt that European teachers would make 
less use of the films than their American colleagues, partly because of the avail- 
ability in schools of first-rate teacher demonstration equipment and partly for 
another reason to be discussed later. 

The criticisms and the reservations mentioned so far are of what one might 
call a strictly professional nature. ‘To the writer, the most revealing criticisms 
were those that had their origin in the differences in the social and cultural 
climates of the various European countries. The format of the textbook is an 
example. It was argued that this textbook should certainly be available to the 
teacher, but that the student needs only a short compendium giving the main 
outlines; the source of information should be the teacher. Similar comments, 
even more powerfully stated, were made about the films. Speakers generally 
recognized the value of the films as a tool to show experiments which cannot be 
demonstrated on the bench. But it was argued that the explanation and elabora- 
tion of the experiment should be left to the teacher; there is no need to have 
another person intrude in the classroom. ‘To the writer it seems unavoidable 
that with little to read, with the laboratory in a subordinate role in the classroom, 
with the masterful and impressive demonstrations, and with very scanty films, 
a relationship evolves which puts the teacher in the position of being the sole 
source of knowledge for the student. ‘The class period is then used for lecture 
by the teacher and note-taking by the student. 

It is clear that in a cultural climate where this is considered the right way of 
conducting classes, the PSSC course cannot fit in its spirit but only in its content. 
From the beginning, the PSSC course was developed with the opposite 
philosophy in mind; instead of the teacher telling the student what the scientists 
have found, he is expected to help the student to find out for himself by experi- 
mentation and calculation how nature behaves. In this system there is quite a 
lot of talking and discussion between student and teacher and among students 
themselves. With this approach it is desirable to invite guest teachers (at least 
on the screen) into the classroom, in particular active research physicists, so that 
the term science will not carry a strange and unworldly connotation. 

Another aspect of the PSSC approach was mentioned in this connection. 
With the rapid change of the world scene and with the tremendous rate at which 
new knowledge is acquired, the young student must be taught how to learn by 
himself once his formal education has terminated. For the professional scientist 
this means to learn more science, for the engineer more technology, but for both 
of them more general understanding of the world they live in. Once out of 
school the young adult must know how to gather facts, analyse them and draw 
conclusions on his own. he Physical Science Study Committee feels that its 
course in physics is a modest attempt toward preparing the coming generation 
for these tasks while they are still in school. 


Ozone in the Earth’s Atmosphere 


by R. FRITH 
The Meteorological Office, Bracknell, Berks. 


SUMMARY 


A brief account is given of the processes which lead to the formation of a 
“layer of ozone in the earth’s atmosphere at a height of about 30 km ; 


> 


together with an account of several instrumental techniques which may be 
used for the investigation of this layer. 


1. INTRODUCTION 


The ‘ oxygen’ we breathe is a gas every molecule of which is compounded 
of two atoms of oxygen. We cannot live without it; so it is strange that ozone— 
a gas which differs from ‘ oxygen’ only in that each molecule is compounded 
of three atoms of oxygen instead of two—is extremely poisonous: a concentration 
of one part of ozone in a million parts of air is usually considered to be lethal. 

Concentrations approaching this are occasionally found in association with 
“smog ’ (the city of Los Angeles has a particularly bad reputation in this respect) 
but usually the concentration at ground level is extremely low—and, happily, 
no higher near the sea-side than anywhere else! The concentration remains low 
throughout the troposphere (in temperate latitudes usually 10-15 km deep) in 
which the temperature decreases quite rapidly with height and which contains 
everything which we usually refer to as ‘ weather ’—fog, clouds and precipitation. 
In the stratosphere the temperature generally increases with height; it is a 
cloudless region with relatively little turbulence and here the ozone concentration 
increases rapidly to a maximum, of up to 10 parts per million parts of air at a 
height of about 30 km; falling off equally rapidly above this level. 

The air at 30 km is, of course, very thin—the density is about 1/100th of 
the density at the surface—so that although the ozone/air mixing ratio is high 
the amount of ozone present is small; in all ozone accounts for no more than 
about one part in ten million of the earth’s atmosphere. Nevertheless this 
ozone, small in amount though it is, effectively cuts off all solar ultra-violet 
radiation of wave length shorter than about 2900 A. Ultra-violet radiation is 
damaging to most forms of life so life could not have evolved in its present form 
were it not for this layer of ozone high in the atmosphere. We must, I suppose, 
put this to the credit side for ozone. On the other hand aircraft are beginning 
to fly at the heights of high ozone concentration and since, at least with com- 
mercial passenger aircraft, air for breathing is normally supplied from outside, 
something must be done about the ozone. Unless at least 99 per cent of it can 
be destroyed in the quite short time spent in the pumping phase, passengers 
would suffer. Fortunately ozone is fairly easy to destroy. 


2. HISTORICAL 


The abrupt cut-off in radiation from the sun at about 2900 A had long been 
known, but it was not until 1880 that it was surmised that this cut-off was due to 
absorption by ozone in the earth’s atmosphere. In that year W. N. Hartley 


376 R. Frith 


first measured the absorption spectrum of ozone in the ultra-violet, discovered 
the absorption bands which now bear his name (fig. 1) and deduced that the 
solar cut-off must be due to ozone. It was at first assumed that this ozone must 
be in the lowest layers of the atmosphere. However measurements of the solar 
spectrum taken at the tops of mountains showed no perceptible change from the 
spectrum at ground level. The ozone was clearly not in the lower layers; but 
where it was remained unknown until the 1920’s. | Between 1920 and 1930 not 
only were instrumental techniques developed which enabled both the total 
amount of ozone and the height of the ‘ ozone layer ’ (more strictly the height 
of the level of maximum concentration) to be determined but it also began to be 
understood why the ozone layer was there at all and a vertical distribution profile 
was calculated on theoretical grounds. 


197104 


10-5 1 T T T i T T 
2000 2200 2400 2600 2800 3000 3200 3400 
Wove length (A) 


Fig. 1. Absorption by ozone in the Hartley band. The diagram shews the amount of 
radiation which would pass through 0:1 cm of ozone at N.T.P. (this is about what 
there is in a vertical column above a point on the earth’s surface). 


3. THE Dospson OZONE SPECTROPHOTOMETER 


The instrumental techniques were basically those which are now in universal 
use in the Dobson Ozone Spectrophotometer. Consider what happens to a 
beam of sunlight during its passage through the earth’s atmosphere. Some of 
the radiation is lost by scattering: some is absorbed by various gases; and some 
may be absorbed by dust. If the intensity of the incoming radiation were known, 
and if the scatter by dust and by air, and the absorption, if any, by dust could be 
estimated, then, from a measure of the intensity reaching the ground, it would 
be possible to compute the amount lost by absorption by gases. If the absorption 
laws of the relevant gases were also known, and if these laws are independent of 
the temperature and pressure of the gas, then the amount of any gas could be 
determined. ‘The absorption laws for ozone are known and the absorption in 


Ozone in the Earth’s Atmosphere Sh 


the Hartley band does not depend significantly on temperature or pressure; so, 
theoretically, the amount of ozone in the earth’s atmosphere could be determined 
by measuring the intensity of direct solar radiation at the ground, somewhere 
towards the long wave end of the Hartley region. ‘There are, however, two 
difficulties. First the intensity of solar radiation ‘ outside the atmosphere’ in 
the Hartley band is not accurately known, even in these days of rockets and satel- 
lites. Second, although the amount of radiation scattered by air can be estimated 
fairly well it is impossible to make a good estimate of the amount scattered by 
dust. These difficulties were overcome by making measurements not of the 
intensity at a single wave length but of the ratio of the intensity at two wave 
lengths at one of which ozone absorbs strongly and at the other weakly. 

Dobson ozone spectrometers are now in routine operation all over the world 
and much has become known about the global distribution of total ozone, and 
about its day to day changes. There is, for example, a latitudinal variation, 
ozone amounts being least in equatorial regions and greatest near the poles; 
superposed on this latitudinal variation are day to day fluctuations closely linked 
with surface pressure systems. 

It was difficult to determine the height at which the ozone occurred until, as 
Gétz wrote in 1934, ‘The improved means now available for measuring the 
amount of ozone in the atmosphere, using photoelectric cells (and here was 
given a reference to a paper describing photoelectric cells), have made possible 
an entirely different method of estimating the height.” The original ozone 
spectrophotometers measured the radiation intensity photographically and could 
only be used with direct sunlight. With the new photoelectric cell measure- 
ments could be made using the very much weaker scattered light. In fig. 2 we 
suppose one of these new instruments is situated at the point P, looking vertically 
upwards, so that it normally receives no direct solar radiation. ‘The radiation it 
receives is that which has been scattered (by air and dust) and we may divide 
this radiation into two parts: that which comes from the atmosphere above the 
ozone region (considered, for the moment, to consist of a single shallow layer), 
and that which comes from below. Denote these two regions by A (above) and 


(b) low sun 


(a) high sun 


Fig. 2. Illustrating the Umkehr technique for the determination of the vertical distribu- 
tion of ozone in the earth’s atmosphere. 


378 R. Frith 


B (below). We denote by \, and A, the two wave lengths at which cnieas Ot ree 
are made (absorption being strong at wave length \, and relatively weak at Az); 
then the ratio J(A,)/Z(A2) for the radiation received at the ground, may be written 
(AQ) + Br) )/(AQ2) + BQs) )- 
We note first of all that the two quantities A(A,) and A(A2) will be almost inde- 
pendent of solar elevation, since, even with the sun almost horizontal as in fig. 
2(b), there is so little air, and no ozone, in the path SA, that solar radiation 
arriving at A is not significantly attenuated. However, although constant, the 


OZONE LAYE 


a 


R2 


Fig. 3. Experiment to measure the vertical distribution of ozone in the earth’s atmos- 
phere from a satellite. 


light scattered will be small since the air density at a is low. So, to begin with, 
with the sun fairly high, as in fig. 2(@) the ratio J(A,)/I(Ag) is dominated by 
the ratio B(A,)/B(A,). As the sun sinks, the path sB passes more and more 
tangentially through the ozone layer and therefore both B(A,) and B(A,) will 
become smaller, and since the absorption is much greater at wave length A, than 
at wave length A, the ratio B(A,)/B(A,) will decrease. Thus, at first, the ratio 
I(X;)/I(Ag) will fall. However when the sun is really low the ray sB passes 
through so much ozone that the term B(A,), if the wave length A, has been suitably 
chosen, will become so small compared with A(\,) that the ratio I(A,)/I(A2) 
becomes effectively equal to A(A,)/(A(A2) + B(Ag) ). Since A(A,) and A(A,) may 
be regarded as constant and B(A,) is decreasing this ratio now increases as the sun 
continues to sink. Thus if the ratio I(A,)/(A,) is plotted as the sun 
sets there will be a turning point (German ‘ Umkehr ’)—usually when the sun is 
about 5° above the horizon—and from the position of this turning point the height 
of the ozone layer could be determined. It is usual to refer to observations of 


> 


Ozone in the Earth's Atmosphere 379 


this type as Umkehr observations. Ozone is not of course, really contained in 
a single thin layer, and the interpretation of Umkehr observations is more 
complicated than implied in the discussion above; however, from a complete 
analysis, it is possible to determine, not only the height of the ozone maximum 
but also the amounts of ozone, in 5 km steps, up to about 50 km. 

This technique was proposed in 1929 by Gotz and observations and measure- 
ments were first made of the vertical distribution of ozone in 1932. (Gétz, 
Meetham and Dobson 1934). 


4. PHOTOCHEMICAL ‘THEORY 


It was just about this time that Chapman and others were asking themselves 
why the ozone was there, and why it was at that particular height. It had been 
realized that the ozone must be formed by the action of ultra-violet radiation 
on oxygen. Ultra-violet radiation of short enough wave length will split a 
molecule of oxygen (O,) into two atoms of oxygen (O). These free atoms of 
oxygen will then combine with molecules of oxygen (O,) to form molecules of 
ozone (O3). However ozone is itself split by ultra-violet radiation, each molecule 
of ozone providing one molecule of oxygen (O,) and one atom of oxygen. Thus 
there will tend to be an equilibrium state in which the rate of destruction of ozone 
by the ulta-violet radiation is exactly equal to the rate of formation of ozone by 
chemical reaction O+ O,—-O;. This chemical reaction, in turn, depends upon 
the rate of destruction of oxygen (O,). Thus the amount of ozone in the equili- 
brium state will depend not only upon the amount of oxygen present but also 
upon the relative intensities of the radiations which destroy ozone and those 
which destroy O,. At very high levels—above 80 km—the radiation which 
destroys O, is so powerful that there is virtually no O, present—and therefore 
there can be no ozone. At lower levels this radiation becomes less intense, 
because of absorption losses; some, but not all, the O, is destroyed; and ozone 
should appear. At lower levels still there is practically none of this radiation 
left, no O, is destroyed; and there should be, once again, no ozone. 

In detail this theory suggests that there should be no ozone up to a height 
of about 10-15 km, then a rapid increase to a maximum at about 30 km, followed 
by a decrease to zero at about 80km. ‘This is broadly what measurements 
reveal; but there are two striking discrepancies: the ozone concentration is much 
too high at low levels; and much too low at the level of maximum concentration. 
It is believed that these discrepancies are linked, both being caused by a redistri- 
bution of ozone by air movements and turbulence. How the ozone may be 
redistributed by air movements and by turbulence is considered in the following 


sections. 


5. REDISTRIBUTION OF OZONE BY AIR MOVEMENT AND TURBULENCE. 


At heights above about 40 km the ultra-violet radiations are so strong that 
equilibrium states for ozone formation are quickly reached. At these levels, 
therefore, there can be no redistribution by air movement for, as quickly as the 
air moves, the ozone concentration adjusts itself to the new conditions. But 
at lower levels, where radiation intensities become weak, the time constants can 
be very long—as great as one year at 25 km. At these levels it is unlikely that 
equilibrium is ever reached, and ozone may readily be redistributed. This 


380 R. Frith 


redistribution may be by large-scale organized air currents (horizontal or vertical) 
or by turbulence. The mechanics of redistribution by air currents are obvious : 
for example if there is a vertical air current through the level of the ozone max1- 
mum then the level of maximum concentration will change. Moreover since 
vertical movement necessarily implies expansion or contraction of air the total 
amount of ozone in a vertical column may also change. It is presumably this 
sort of thing which causes those day to day variations which are found to be 
closely linked with the synoptic situation. The interpretation of measurements 
of total ozone, and of its vertical distribution, would be much easier than it is if 
this were the only method of redistribution. Unfortunately turbulence also 
plays a part. . 

Turbulence tends to move matter always from places of high concentration 
to places of lower concentration. In the atmosphere turbulence processes must 
always be moving ozone from the level of maximum concentration, at about 30 km, 
to levels of low concentration lower down and in so doing the concentration will 
be reduced at 30 km and increased at the lower levels. This is what is found. 
Very little is yet known about turbulence at these great heights and we cannot 
tell whether turbulence is, in fact, a major factor in the redistribution process 
or not. Many more measurements of ozone concentration are needed, and these 
will, it is hoped, be provided by the satellite experiment which is now being 
prepared by the British Meteorological Office. 


6. A SATELLITE OZONE EXPERIMENT 


The ozone experiment which the Meteorological Office is building will, all 
being well, be put into orbit by the National Aeronautics and Space Administra- 
tion of the U.S.A., using a Scout launching vehicle, in 1963. In this experiment 
measurements of ozone concentration are made only at sunrise and sunset— 
when the satellite ‘ sees’ the sun through a ‘ corner’ of the earth’s atmosphere 
(fig. 3). As the satellite moves from A to C the amount of ozone between the 
satellite and the sun steadily increases and therefore the intensity in an ozone 
absorption band of the solar radiation reaching the satellite, steadily decreases. 
By measuring this radiation continuously during the minute or so the satellite 
takes to pass into, or out of, the earth’s shadow, and applying a sort of 
differentiation process, the vertical distribution of ozone can be computed. It will 
not be possible to measure ozone amounts below about 30 km since, for rays which 
skirt the earth at distances less than this, the ozone absorption, and the scatter by 
air and dust, will be so great that practically no light will get through to the 
satellite. But from 30 km upwards it will, it is hoped, be possible to compute the 
ozone amounts over 5 km layers with useful accuracy. ‘Two measurements will 
be made each orbit and the 30 or so measurements in any 24 hour period will 
girdle the earth along two lines of latitude, one north of the equator, one south. 
The precession of the satellite orbital plane, caused by the equatorial bulge, 
will result in a gradual shift of these two girdles so that the earth will be com- 
pletely covered (apart from the polar regions, which will never be seen) in 
about one month. 

It is of interest to note that a similar, but cruder, experiment has been carried 
out by observing the moon at times of lunar eclipse. During a partial eclipse 
the shadow of the earth is cast on the moon. Close to the edge of this shadow 
there must be a border of reduced illumination, this being, in effect, the ‘ shadow ” 


Ozone in the Earth’s Atmosphere 381 


of the earth’s atmosphere. If light from this area of reduced illumination is 
examined spectroscopically, and compared with light from a point well away 
from the earth’s shadow, the reduction in intensity due to ozone absorption can 
be determined and an estimate can be made of the height and intensity of the 
ozone layer. 


Balloon Observations 

The satellite technique will make measurements only down to 30 km or so. 
Measurements below 30 km can be made using balloons, and instruments 
operating on two quite different principles are now in use. In the optical sonde 
measurements are made of solar radiation in ozone absorption bands and, as the 
sonde rises, the ever decreasing amount of ozone between the sonde and the 
sun, and thence the ozone concentration at each level, computed. These 
measurements can, of course, be made only during the day time. Other sondes 
use adaptations of established chemical methods for measuring ozone concentra- 
tions, telemetering the measurements back to ground. Chemical techniques 
have the advantage over optical methods in that much greater detail is obtained, 
and they can be used by day or by night. On the other hand they do not give any 
information about the amount of ozone still above the balloon at the limit of the 
ascent. Measurements using both types of sonde have been made rather 
spasmodically in a number of countries but it is hoped that regular synoptic 
networks of ozone sondes will be established before long so that it will be possible 
to combine the results from this network with satellite measurements and so 
obtain information continously, on a global scale, about the ozone distribution 
from the surface upwards. 


REFERENCE 
Gotz, F. W. P., Mzeruam, A. R., and Dosson, G. M. B., 1934, Proc. Roy. Soc., A 145, 416 


The Author: 

Joined the Meteorological Office, from Trinity College, Cambridge in) 19869" Up to, 
and throughout, the war he was engaged on forecasting duties, but in 1946 he became 
Head of the Meteorological Research Flight at Farnborough. In 1960 Dr. Frith was 
appointed Head of the newly formed High Atmosphere Research Branch of the 


Meteorological Office. 


The Philosophical Basis of Physics 


by GERD BUCHDAHL 
Cambridge University 


Our first reaction to the very suggestion that physics is in need of a basis is 
probably much like that of Sir Oliver Lodge when he wrote (in Man and the 
Universe, 1908, p. 50): 

“To say that a system does not rest upon one special fact is not to impugn its 
stability. ‘The body of scientific truth rests on no solitary material fact or group of 
facts, but on a basis of harmony and consistency between facts: its support and ultimate 
sanction is of no material character.” 

So we might insist that physics is not in need of a basis, and that the paradigm 
of something ‘ supported ’, either from above or below, is the wrong one. But 
perhaps this is a misunderstanding; we cannot tell until we have seen this picture 
in its historical setting. Perhaps that also will teach us something about the 
possible justification of the picture, though we shall see that there is an important 
element of insight in Lodge’s remark. 


1. DEDUCTIVE AND NON-DEDUCTIVE APPROACHES 


One source stimulating the search for foundations has always been the 
pre-occupation with axiomatic systems. Plato already used geometry as his 
primary model: scientific knowledge there comes to be defined in terms of 
theorems derived from ‘ hypotheses ’; and to avoid an infinite regress he imports 
the notion of a ‘direct apprehension’ of the primary hypotheses. Even in 
those early beginnings, it is never clear whether ‘ apprehension’ is a subject 
for procedural instructions, or whether the reference is rather one to ‘ pre- 
suppositions’. Thus, when Aristotle alludes to the need for the ‘ intuition ’ of 
the basic premisses, we cannot be sure whether he is drawing attention to the 
need for a special (though mysterious) mental act which is itself the result of a 
definite methodological enterprise, or whether he is only saying that possession 
of such premisses carrying universality and necessity is an ultimate presuppo- 
sition of scientific knowledge. 

So far the logical aspect. But our mathematical model also had a more 
direct and specific influence: under its prompting mathematics as such becomes 
the ideal of knowledge. As an immediate consequence this raised the problem 
of the relation of physics to mathematics. There are several traditional responses 
to this question, among them the following: 

(a) The Plato-Locke tradition: there exists a ‘ cut’ between mathematics and 
physics; physics is only a ‘ likely story’, mere ‘ opinion’, not yielding the 
‘certainty ’ of mathematical knowledge; itself ‘ certain’ of course only 
because abstracting from all physical attributes not in agreement with it. 

(b) The Cartesian tradition: the identification of physical matter and geo- 
metrical form; the ‘ mathematization of physics’. 

(c) The Galileo-Newton tradition: there is always some discoverable mathe- 


matical formulation of physical facts, without the latter losing their own 
physical status; ‘ physics in mathematical harness’. 


The Philosophical Basis of Physics 383 


The history of science reveals all three approaches. The works of the great 
17th century methodologists and philosophers display the first brilliant sketches 
defining the various methodological principles and precepts, and illustrating 
the corresponding procedural routes. And because they come to it fresh and 
with an open mind, their writings for ever repay renewed study; the doctrines 
which they develop, and which later generations have turned into hardened 
dogmas, we are here able to observe emerging and growing in their proper 
context. ‘They often therefore yield insights denied to us when concentrating 
solely upon the more normal developments of later ages. Descartes’ scientific 
and metascientific writing offers one of the best illustrations of the rich variety 
of ideas introduced into the field of methodology and logic at that time. 


A. Deductive approaches 


“Apprehension of first principles”? and ‘‘ mathematization of physics ” 
become united in Descartes’ attempt to place the fundamental laws of dynamics, 
and in particular the law of inertia (‘ Newton’s first law of motion’) on a purely 
conceptual basis, though this was of course not inconsistent with the belief in 
the possibility of such laws at the same time yielding powerful explanations—as 
indeed they were subsequently found capable of doing. The train of Descartes’ 
reasoning is briefly: given the idea of pure spatialization of matter, and its 
characterization by a single additional attribute, viz. motion; defining this 
motion as a ‘ state of matter ’, where a ‘ state ’ (like other ‘ properties ’) is subject 
to the causal principle; adding the subsidiary definition of ‘ motion’ as rate of 
change of displacement with respect to time; and finally assuming space to have 
Euclidean properties; then the first law is a direct logical consequence of these 
premisses and concepts.) 

This is quite in line with modern logical discussions of dynamics which 
frequently consider its primary axioms to be implicit definitions of its key terms, 
e.g. inertia, mass, time, force, energy, etc.@) ‘The most extreme form of such a 
conceptual approach is in Eddington’s work, which interprets many of the basic 
major principles of physics either as methodological prescriptions or as the out- 
come of the choice of a certain mathematical system. 

The feeling that the law of inertia is a ‘ fundamental ’ law, is probably due to 
the fact that it may be made to appear (as in Descartes’ account) a special 
‘interpretation’ of a version of the causal principle (‘‘ what is real does not 
change’). Such principles form a sort of ‘regulative component’ of the 
corresponding physical law; they frequently function ‘directively’, leading to an 
‘anticipation’ of the law, long before normal abstractive and experimental 
procedures have been able to supply any ‘ constitutive support ’ for it. Galileo, 
who is frequently credited with having discovered this law, certaintly never did 
anything of the kind. And Newton simply gave it an experimental and 
observational basis, in terms of the numerous verifiable deductions from his theory. 

A note on concepts may here bein place. It is fairly easy to see that concepts 
such as mass, force, energy, might be implicit ‘functions’ of corresponding 
axioms, but this is less obvious when we generalize to include all scientific 
concepts, particularly those which though being ‘ unobservables’ we are more 
often inclined to consider in rather more ‘ realist’ terms, e.g. such substantival 
entities as atoms, molecules, electrons. Nonetheless, according to some views, 
to say that a certain theoretical entity ‘ exists ’, means no more than that certain 


384 Gerd Buchdahl 


explanatory axioms about the properties of an otherwise unspecified ‘ something ’ 
are true, and furthermore, that these axioms imply certain theorems which allow 
of independent and direct verification. 

On the other hand, there are obvious differences between a ‘ substantival ’ 
type of concept such as a molecule, and ‘ adjectival’ concepts like ° refractive 
index’ or ‘ gravitational force’ which act merely as specifiable functions or 
parameters of some corresponding laws which hold between a certain set of 
spatio-temporally determined observables. The former are about certain 
putatively existential entities; because of this (i.e. because of their “ substantival ’ 
nature) they do not explicitly appear in the symbolic formalism of a physical 
theory. Thus, no symbol for ‘ molecule’ (only for its mass and velocity, etc.) 
is to be found in a quantitative version of the kinetic theory of gases. Like 
Locke’s substance, being an ‘‘ unknown-I-know-what”’, it drops out of the 
language of physics. Neither the ‘ non-existential’ status of the ‘ adjectival ’ 
group, nor the non-representational aspect of the substantival group, deprives, 
however, either group of what is often called ‘ reality’; such judgements would 
be purely arbitrary. 


B. Inverse-deductive (or ‘ retroductive’) approaches 


Descartes at times refers to the deductive approach as the ‘ synthetic method ’; 
and he contrasts this with what he conceives to be a procedure moving in the 
opposite direction, which he calls the ‘method of analysis’, or analytic method. 
Its advantage, so he claims, lies in the fact that where applicable it actually 
guides us in the discovery of ‘ causes’ for already known physical ‘ effects’. 
(Cf. also Newton’s Opticks, Qu. 31, for the same idea). ‘The deductive method 
is mainly occupied with the formulation of explanatory systems through 
axiomatization or with the laying of conceptual foundations for a science. The 
analytical method emphasizes the primary need for a path leading to the discovery 
of hypotheses in the light of phenomena requiring explanation. Descartes, 
(like many modern writers) does not always distinguish too clearly between the 
procedural aspect of the analytical method on the one hand, and its inferential 
component on the other. For the present, we will concentrate on the first of 
these, i.e. the actual construction of hypotheses. 

Now there is no set of mechanical working-rules for the construction of 
hypotheses. Descartes seems to have believed that he had discovered a universal 
‘rule of method ’, partly because what he had really discovered (or rediscovered) 
was the notion of retroductive inference, according to which it makes sense to 
hold that we can ‘ prove’ the hypotheses via the power of their verifiable 
deductive consequences. ‘This, whilst true, is no substitute for a method of 
discovery. Rough hints exist in plenty: curve-fitting, the employment of 
analogy, even the injection of regulative ideas, and countless others. Again, 
by the ‘ analytic method’ we might mean literally such things as the resolution 
of complex motions into their simple components. This was the version 
appearing in Newton’s Opticks, although Newton added to this “‘ the derivation 
of the forces producing such motions’ : which is something that cannot be 
obtained by the same type of ‘ resolution ”! 

Still, there lies behind this notion of analysis a fundamental idea: the breaking 
down of a complex of phenomena into its simple underlying operative factors. 
And the fascinating thing is that Descartes actually presents his readers with an 


The Philosophical Basis of Physics 385 


instance showing how this method may be understood to work in detail; the steps 
in his discovery of the law of refraction, often called ‘Snell’s Law’, are as follows. 
Let us assume the discussion of refraction in terms of ‘ rays of light’; mere 
observation then will yield the (qualitative) information that the ray is broken 
towards the normal to the interface between the media when entering the 
‘ denser’ medium (e.g. air into water). We now feed into this information an 
account of the ‘hypothetical mechanism’ (and Descartes refers to it expressly as 
a mere hypothesis) in terms of which we describe the passage of light. Descartes’ 
account here mentions an all-pervasive ether possessing a particulate structure, 
the passage of light being defined as the passage of a ‘ shock-wave’ travelling 
from the luminous body to the illuminated surface. As the ether is not ‘ com- 
pressible ’, the wave moves with ‘infinite speed’. This is still difficult to 
understand, for the concepts are not sufficiently clear. We shall therefore dig 
still more deeply: to this end we must consider the passage of any natural power 
be it a wave or a particle. 

So far all is qualitative. Descartes now ‘tightens’ his account by intro- 
ducing a geometrical model to derive the law of refraction quantitatively. 
In other words, he develops a conceptual model to which he then applies those 
general laws of motion which he had already developed elsewhere and which 
he believed to be of very general application. His model does not exactly 
reproduce the (already hypothetical) structure; the reason is that with the 
absolute freedom of genius he considers the matter in terms of the motion of a 
particle, of whose laws he has a knowledge which he lacks concerning the 
hypothetical wave motion. The derivation is not ‘ pure’: there is (as in most 
‘real’ scientific situations) considerable ‘ fitting’ of the account to the desired 
result; we need not reproduce it in any further detail. Yet the general procedure 
is clear: the introducing of the hypothetical mechanism; the search for the 
fundamental operative process, the tightening of this in terms of a quantitative 
model, reasoning round which ultimately leads us back to the desired ‘law’ 
accounting for the qualitative and quantitative relationships. ‘There are here 
the essential ingredients of a genuine ‘ motion’ from ‘ effect’ to ‘ cause’: the 
tentative introduction of a theoretical schema (making it once more possible to 
‘reason deductively’); the rapid shuffling backwards and forwards between 
phenomena and explanation; the side-reference to external principles; the 
employment of models not necessarily isomorphic with the putative hypothetical 
or theoretical structure. 

The statement that this was an instance of a ‘ motion’ from effect to cause, 
needs some qualification. There is “ motion’ in that direction only in the sense 
that we began with the set of observed pairs of angles of incidence and refraction, 
rising from this to a consideration of the qualitative behaviour of light rays; and 
we ended with the formula—‘ the hypothesis’ in accordance with which the 
values of the angles of incidence and refraction are computable. However, 

there was clearly no step-by-step ascent in mechanical fashion from the explican- 
dum to the hypothesis! Less still should the phrase ‘ motion from effect to 
cause ’ mislead us into thinking that here is a method which guarantees a deduction 
of the explanation from the explicandum. (Scientists often speak of a certain 
law, e.g. the law of constant proportions, as having been inferred from certain 
experimental results, a phrase which frequently causes much misunderstanding. ) 
But as already mentioned, Descartes’ notion of analysis also covered—in pi 
CE; B 


386 Gerd Buchdahl 


to the procedural aspect—the idea of an inference (non-deductive!) from effect 
to cause; and this idea is still with us: it goes under various names; we have 
called it ‘ retroductive inference ’, the better-known ‘ inductive inference’ being 
another member of the same species. And we shall now turn to the problems 
raised by the second, the inferential, aspect of analysis; problems which will 
involve more explicitly the contentious notion of a ‘ philosophical basis ’. 


2. A LOGICAL PROBLEM: IS THERE AN INDUCTIVE INFERENCE ? 


The fundamental Jogical operation involved in retroduction (as well as in- 
duction) is the affirmation of the antecedent, given the consequent. Of course, 
if we should prove that some given or presumptive construction alone would 
account for a set of data, this would give us formal validity. Likewise, if an 
hypothesis can be shown to imply a consequence contrary to fact, it is thereby 
formally invalidated. But between these logical cornerstones there lies the 
vaguer though more substantial region of positive affirmation. If the hypothesis 
has led to verified data; and if it has not been falsified; and if it fits into a more 
general network of other non-falsified hypotheses; in short if it comes up to the 
standards of procedural performance discussed in section I, above: are we then 
‘entitled’ to ‘infer’ that this hypothesis holds universally ? 

At the outset, we must recognize that in recent times vital objections have 
been raised against the whole conception of non-deductive inference as such: 
objections which if sustained would go a long way towards showing that our 
question was misconceived, even senseless, and with it the whole baneful notion 
of a ‘ basis’. Moreover this suggestion is supported by the consideration that 
no satisfactory answer to our question has ever really been proposed. ‘These 
objections occur in two versions, the first of which is due to Karl Popper, the 
second to Gilbert Ryle, both representing a movement of considerable contem- 
porary thought. 


(1) Since retroductive and inductive inferences are formally invalid, we cannot 
thus define at all the form taken by scientific reasoning. Instead we need 
to replace this by a formally valid inference schema, which involves a falsifica- 
tion procedure, according to which a scientific test can yield an assured result 
only if (and in the sense that) it disconfirms a proposed hypothesis. It is then 
only a ‘ convention’ to hold on to an hypothesis so long as it has not been 
disconfirmed; but as such, non-negative tests have no logical significance for 
a verification-procedure. (®) 


(2) This is a slightly more extreme version of the first, and holds that we should 
disabuse ourselves of the idea that scientists at work ever pass on to the 
business of making an inference of any such kind (excepting deductive 
explications). Rather, they merely show that certain hypotheses work by 
showing them working, i.e. through leading to verifiable consequences. We 
can show our hypothesis working—i.e. observe it as ‘ work-in-progress ’, 
but we do not in addition pass on to any sort of inference: ‘... therefore it 
is true ’.(§) 

I am not sure that either of these objections to non-deductive inference are 
really serious. Let us consider them briefly in turn. The first and the 
methodological prescription which it entails, seem to possess a certain plausibility 
for many scientists. This is due, I think, only to a lack of realisation of the 


The Philosophical Basis of Physics 387 


extreme puritanism it involves. It is certainly true that we attempt to test an 
hypothesis by pushing outward and thereby determining the frontiers beyond 
which it may be found to break down. But that is not all this view really entails. 
For the scientist, when he embarks on new tests extending over a novel range, 
with the possibility that his hypothesis might suddenly fail, would certainly 
take it that a non-negative test had implications for the range over which the 
hypothesis is in fact believed to hold! Whereas it is a consequence of the 
doctrine here in question, that a positive test (absence of falsification) contributes 
strictly nothing towards any possible verification, towards any rational belief 
in the strength of the hypothesis; for to do so we should have to assume that such 
tests when repeated under identical conditions would yield similar (or statistically 
predictable) results; i.e. that the situation in question is putatively universal, 
which is just the assumption which the present view was intended to avoid; 
indeed, to make unnecessary. 

Similar comments apply to the second objection. The proposition: ‘ The 
theory works since it can be shown to work’ is not tautologous; its successful 
application on given individual occasions is taken to imply something concerning 
its universal applicability. 

That these doctrines when considered closely lead to situations which appear 
to conflict with the normal logical instincts of scientists, is perhaps no refutation. 
On the other hand, the fact that its plausibility for the scientist depends upon 
the surreptitious re-introduction of the very concept whose contentious nature 
had led to its attempted elimination (the concept of inference to a universal or 
lawlike connection) suggests that we should pursue further the problems 
surrounding it, in order to give ourselves the chance at least of facing clearly 
the logical complications involved. 

The question then is: how do we ‘ know’ that a certain test under identical 
conditions would yield identical results? ‘Know’ is here put in quotation- 
marks in order to remind ourselves that we are no longer concerned with the 
usual every-day use of ‘knowing’. In other words, our enquiry now concerns 
the ‘basis’ of the concept of ‘uniformity’ or ‘law’. It is important to 
emphasize the conceptual nature of this enquiry because this fact is seldom 
realized; we act as though we were concerned with the question: how can we 
know that a certain law is true; or perhaps even “ necessary and eternal ”’ 
(Galileo’s words). The answer is: by observation, experiment and theo- 
rizing, and certain additional procedures of conceptualization which we have 
briefly touched on in the first section. And this entails the consequence that 
the conclusions thus arrived at carry a built-in corrigibility-proviso, according 
to which—as Newton put it in his fourth ‘ Rule of Reasoning’ in the Principia 
(Bk. i1i)— 

“we are to look upon propositions inferred by general induction from phenomena 
as accurately or very nearly true . . . till such time as other phenomena occur by which 
they may either be made more accurate or liable to exceptions ”’. 

Clearly our question is not to guess what further data will go to support or 
upset a given law ‘ inferred by general induction from phenomena ’, but rather, 
what is meant by ‘inductive inference’, by the very conception of ‘law’, 
relative to individual phenomena, and what general considerations about the 
world are presupposed in our attempt to make sense of the concept so 


defined? 


388 Gerd Buchdahl 


3. INDUCTIVE INFERENCE 


(1) Newton’s ‘ Rules’ and the ‘ conventionalist’ doctrine. 


We have tried to separate the two main aspects of scientific thinking, the 
procedural and the inferential; the latter will occupy us for the remainder of this 
essay. Although Descartes had flirted with the problem of retroductive inference, 
on the whole he had been content to fall back on a solution in terms of his * dog- 
matic ’ conception of the ‘ mathematization of physics’. But it was the marriage 
of the Galileo-Newton approach to the Plato-Locke tradition that was to put the 
critical problem into a sharper focus. Newton’s pronouncements on induction 
in the ‘ Rules of Reasoning’ for the Principia, as well as passing references in 
letters to some of his correspondents, mean that in science we must ‘ deduce 
principles from the phenomena’ in order that thereafter they be ‘ made general 
by induction’. This, whilst it helped to formulate the matter, did nothing to 
explain the significance. Perhaps this fact is itself of some importance. Such 
logical formulations would then have to be understood as ‘ procedural gestures ’ 
(cf. Ryle’s ‘ work-in-progress’ doctrine), in accordance with which hypotheses 
formulated in the light of certain phenomena requiring explanation are to be 
understood to possess a generalizing function, a fact expressed by rendering them 
in universal form, or using them with universal import. To say that these are 
‘ procedural gestures ’ would then mean that it does not make sense to demand a 
‘justification’ of these gestures; like social customs, they have social (e.g. 
grammatical) purposes; and these at most require causal explanations. It is as 
though we always demanded a justification for or explanation of any general prin- 
ciple in terms of which we happen to account for some particular state of affairs. 
This, so we then should be told by the ‘ conventionalists ’, would be something 
of a philosophical joke, for general principles are simply introduced as ‘ instru- 
ments ’ for explanations into the general social scientific context; whilst we can 
properly ask for explanations of individual cases, it is socially improper to ask 
the same question concerning explanations of the explanations,—at least; in 
any ultimate sense. 


These conventionalist positions seem very sophisticated, if only because they 
appear to beg the very question at issue. To consider the analogy last mentioned ; 
the argument would succeed here only if explanatory laws could be considered 
exclusively through the use of the philosophical paradigm of ‘ reason-giving 
devices’, thereby treating them as social tools; whereas there are counter- 
instantial paradigms, according to which an explanatory law may be considered 
(for instance) as a novel discovery, capable of falsification; and here (making in 
fact use of the conventionalist mode of argument itself) it would be perfectly 
proper to require a justification. ‘This reflection once again does not constitute 
a logical refutation. Nor is this serious since ‘ logical refutations ’ in philosophi- 
cal disputes proverbially lack sustained force! On the other hand, it suggests 
two things: we should persist in taking the problems raised by the notions of 
‘law’ and ‘ inference ’ seriously; but we should also endeavour at the same time 
to learn the lesson that a ‘ solution’ of the resulting conflicts will have to lean 
heavily on the fact that the philosopher has a position of ‘ relative autonomy ’ in 


terms of the paradigms through which he is going to consider his philosophical 
material. 


The Philosophical Basis of Physics 389 


(2) The Newton-Locke tradition and the development of Logical Atomism. 


The Newtonian approach of ‘ physics in mathematical harness ’, deprived 
of the Cartesian solution, seemed to question why nature should be—in Galileo’s 
words—“ written in the language of mathematics”; a language whose model- 
relation is that of logical entailment, of necessity+; a language which operates 
with functional relations represented by the general form y=f(x), interpreted as 
a rule ‘ rigidly determining values of y, given values of x’. 

Here John Locke was the first to discuss critically the logical issues involved 
in the Newtonian corpus of knowledge (in his celebrated Essay Concerning Human 
Understanding, 1690). His solution (if it can be called such) is that we have 
‘real knowledge’ only of ‘mathematical truths’, and that such knowledge 
applies to ‘ real things ’ solely because we autonomously select only those features 
of the physical world which agree with the mathematical ‘ archetypes’. (Essay, 
iv. 4, 6). This doctrine which has had a far-reaching influence down to our 
own days, was epitomized in Kant’s dictum that every ‘systematic study of 
nature contains only as much science properly speaking as there is in it mathe- 
matics applied to its data”? (Metaphysical Foundations of Science, Preface.®)). 
For Locke, this view had certain scepticist consequences. For it seemed to 
follow that all ‘ knowledge ’ based on the empirical observation of material bodies 
was “but judgement and opinion, not knowledge and certainty” (Essay, iv. 12. 10). 
““ Certainty and demonstration are things we must not, in these matters pretend 
to’, he wrote (Essay, iv. 3. 26); and his terminology suggests (and his argument 
implies) that this is because information gathered through ‘ observation and 
experiment’ does not come up to the ideal of yielding ‘ necessary connexions ’ 
found in the ‘ demonstrations ’ of the mathematicians. 

This again is a ‘ philosophical complaint ’, and not a‘ scientific’ one. Locke 
was fully aware of the success of Newton’s Principia. His preoccupation is 
with epistemological questions; he struggles to hammer out an adequate con- 
ception of ‘ scientific knowledge ’, a struggle which is brought on by the conflict 
that he seems to find between the paradigm of mathematical knowledge on the 
one hand, and what he calls ‘ experience ’-on the other. 

The origins of this conflict lie in the Lockean conception of ‘ experience ’ 
as being equivalent to the perception and combination of ‘ideas’. ‘These ideas 
he views each as being quasi-self-contained logical ‘atoms’; they “enter the 
mind by the senses simple and unmixed” (Essay, 11. 2. 1.) Hence, complex 
ideas (and most of our ideas, including ‘ generalizations ’, are complex) must be 
compounded, the result of a ‘ subjective’ synthesis. 

Though based on small psychological evidence, this story is a myth for the 
following doctrine: 

(a) All knowledge is based on experience. ‘This entails that the character 
of the known is essentially determined by the character of the ‘ process ’ 
of knowing, an important mark of which is that the knower is able to 
consider the known as it is presented ‘ in itself’, apart from all relations 
to other knowable objects (or ideas). This doctrine of the ‘ presentational 
immediacy ’ of the object of knowledge, together with the premisses just 


+ cf. Galileo’s reference to the laws of nature being ‘necessary andeternal’, which we might 
here use, not with respect to the problem of verification of laws, but the meaning of the 


concept. 


390 Gerd Buchdahl 


stated, at once yield the ‘ atomicity’ of all ideas as such. (Atomistic 
ideas of course were a commonplace to the 17th century ‘ corpuscularians’.) 
(b) It is a consequence of (a) that relating our simple ‘ideas ’ (of objects) into 
complex ones of substances involves a subjective (‘ mental ’) factor, an 
element of spontaneity on our part. (Locke calls it “ voluntariness i) 

(c) The relations whereby we unite our ideas in experience do not exhibit 
the necessitarian character required by the argument from science, by 
our conception of law; e.g. of causal law. 

The answer to a possible objection to this conclusion will make the last point 
clearer. For it may be said that surely it is ‘in experience’ that we find the 
qualities of objects united for us; the unity can hence be hardly of our own making. 
This is however not Locke’s point. For he would certainly grant as much but 
add that ‘in experience’ qualities are only united ‘ here-and-now’, whereas 
the unity which we mean by an ‘ object’ entails a combination which holds 
universally and with necessity (what Locke calls ‘the real essence ’)—once more 
a conceptual matter (Essay, iv. 12. 9). And this kind of unity is certainly not 
‘ given ’ but is ‘‘ made voluntarily ” (Essay, ii. 12. 2), and hence though certainly 
‘real’ is not ‘ objective’. As he puts it at iv. 3. 14 of the Essay: 


“ Let our complex idea of any species of substances be what it will, we can hardly, 
from the simple ideas contained in it, certainly determine the necessary coexistence of 
any other quality whatever. Our knowledge in all these enquiries reaches very little 
farther than our experience.” 

This, then, is the background to the problem with which philosophers since 


the 17th century have been wrestling, and I think we understand it better by 
tracing it back to its origins. What is the foundation of that relation (called by 
the 17th and 18th century philosophers ‘ necessary connection’, though we 
might want to generalize this) in virtue of which we transcend the given moment 
of * presentational immediacy ’ (to use Whitehead’s term); more generally even: 
what makes it possible—in the sense of logical justification—to speak of ‘ external 
objects ’, of ‘ objective’, lawlike connections ? 


4. THE “ FOUNDATION OF SCIENTIFIC INFERENCE’”’: SOME HISTORIC SOLUTIONS. 


It would be possible to approach such questions in various ways. 

(1) Investigate the general philosophical nature of such questions. For instance 
we might pursue the consequences of arguments involving considerations of 
philosophical paradigm-cases (cf. end of section III) and reflect on the nature 
and power of this type of philosophical treatment. For instance: what kind 
of considerations, and what sort of answer do conceptual questions deserve, 
once we have severed them (as we have found necessary) from their material 


basis ?() 
(2) Investigate the nature, meaning and logic of concepts like ‘ induction ‘s 
“inference ’, ‘law’, ‘ necessity ’, etc. 


(3) Ask what generates the idea of an objective causal connection, and what is 
the general justification of this idea. 

It is only during our own century that a conscious start has been made on 
the exploration of the first two of our questions; whilst the remaining question set 
the tone in terms of which most philosophical systems of the 17th and 19th 
centuries, from Hume through Kant, have debated this matter. Nor should 
we think that their perspectives were necessarily fashioned by intellectual 


The Philosophical Basis of Physics 391 


obtuseness. In many cases the ‘ solutions’ bear a surprising similarity to one 
another once the temporal forms have been stripped away. Which explains the 
importance for us of a renewed study of some of these earlier powerful and 
penetrating attempts. 

For instance, a great deal of time and ingenuity has been devoted in recent 
times to the investigation of the logical nature of lawlike statements. Should 
we think of these as empirical generalizations or as general schemata, ‘ methods 
of representation ’ functioning perhaps like ‘ inference tickets’ permitting the 
making of predictions ?@) Or again, are lawlike statements a class of ‘descriptive 
statements ’, or are they more like statements of an entailment relation, indicative 
of some kind of necessity?“ How to explain the peculiar feature of ‘ unfulfilled 
hypotheticals’ ?@*) When we say that the evidence supports the lawlike 
hypothesis, is the relation involved between evidence and hypothesis characteri- 
zable at all as deductive or inductive, or is it some quite different, as yet unspeci- 
fied logical relation? %) These are some of the formulations that have been 
fashionable in fairly recent times. ‘They are still the old questions dressed up in 
new clothes: though this may help by suggesting new hints for treatment. But 
in any case, the three questions listed above are after all closely related, and need 
joint investigation. ‘Thus, you cannot ask ‘what does or can one mean by 
necessary connection’, without also asking how one would justify such a thing. 
Per contra, some celebrated philosophical versions (cf. the Kantian opus) have 
justified these concepts by showing that an analysis of their meaning could be 
obtained by bringing to light what was involved in thinking about science in 
general. 

So whilst our philosophical activity has certainly grown more like an apparently 
verbal enquiry in recent times, it is not really that. It is still the old task of 
discovering a meaning for, and a justification (or the impossibility of such a 
justification) of a basis. Let this be our excuse, then, in concluding these notes 
with a very brief account of what is still the most subtle and the most penetrating 
treatment which our problem has received by civilized man, namely in the hands 
of Hume and Kant. 

Hume (in his Treatise of Human Nature, 1739) postulated from the start 
that we do have this idea of causal, or necessary, connection. Its justification 
cannot lie among the relations as they exist between objects in the external world: 
this follows once more from their atomicity, and the consequent subjectivity or 
spontaneity of our ‘ connecting’ activity. We are forced to make, or at any 
rate, we do make, inferences,—certainly in Newton’s sense of spontaneous 
‘inductions’. But whence shall we collect the ‘ necessity’ of the connection 
that would guarantee the possibility of the inductive inference ? Hume’s exciting 
and revolutionary escape from this dilemma was that it is not the necessary 
connection that has to be supplied to justify the inference but rather that the 
making of the inference and the presence of the necessary connections are one 
and the same thing; the latter, instead of being a presupposition of the former, 
is necessarily and spontaneously given together with it. In other words, 
necessarily connecting is a universal characteristic of the tmductively reasoning 
animal. (Treatise, 1. 3. 14). 

Now this answer is surprisingly similar to the one found in Kant’s Critique 
of Pure Reason, (1781). Kant simply sharpens the contrasts ; he sees more clearly 
than his predecessors that the atomicity of ‘ideas’ is not a quasi-psychological 


392 Gerd Buchdahl 


fact but a philosophical postulate which is forced upon us since without 
it we cannot begin to talk of ‘knowledge’ at all—a surprising inversion of the 
old complaint! His argument was that if you consider the concept of relation 
as given together with the concept of the related (i.e. if both were viewed as 
‘ given from without ’), this would have the logical consequence of automatically 
introducing the sceptical doubt regarding the required necessitarian character 
of the relation in general; Kant’s hidden premiss here being that only what is 
contributed by the logical self (really, the logical forms of thought) is exempt 
from surprises; and is so contributed necessarily, then necessarily exempt! 
Making a virtue of the results already achieved by Locke and Hume, he hence 
considered the subjectivity of the concept of relation as the only guarantee of its 
being such as it presents itself; and the thing is deprived of its arbitrariness by 
showing that without the relation we could not speak of a scientific experience 
at all; not a very surprising conclusion since Kant has so contrived his analysis 
that what his sceptical doubt has taken away from the observational data, his 
definition of knowledge must subsequently put back again. 

Moreover, the ‘ necessitarian’ character of the concept of relation is made 
good also in asecond sense by the argument that the character of the combination— 
being ‘ spontaneous ’—would be wholly ‘ subjective ’ unless we conceive it in a 
‘determinate way’. But that we so conceive it, is actually entailed by an analysis 
of the character of even the simplest experience: say of the content of our thought 
which asserts ‘ This is so’: or, ‘ This follows that’; or, ‘ This is simultaneous 
with that’. Thus, in the very conception of ‘ an objective state of affairs ’ (e.g. 
of a ship going down a stream) there is contained the thought that your ideas— 
antecedently conceived atomistically, and ‘ loose and separate ’—in the required 
act of combination must be considered as related in a determinate order; the 
model for this order being a ‘ rule’ or ‘ function’, the paradigm for which is 
once again the formula of mathematics. (As with Locke and Hume, this ‘ act 
of synthesis’ is, I think, a philosophical myth, which, when translated into a 
logical terminology, can be seen to yield some important results.) The specific 
form of such a principle expressing a ‘ determinate order’ would then depend 
upon the type of experimental proposition involved. ‘Thus, the statement 
‘X is followed by Y’ carries the built-in categorical form expressed in the 
principle that ‘ everything that happens presupposes something which determines 
it in accordance with a rule’, the so-called principle of causation.t+ 

This was however a costly answer. Remember that we began with the 
complaint that whilst we could ‘ observe’ united qualities in the moment of 
presentational immediacy, we required a justification of the assumed feat of 
universal and necessitarian connection,—or better: of this concept of connection; 
a concept which entailed transcendence beyond the ‘specious present’. But 
now we find that in order to guarantee the absolute tightness of the argument to 
the ‘reality’ of the relation, the synthesis—Kant’s necessary synthesis—must 


} It will be noted that when thus stated, the principle is not inconsistent with the 
statistical aspect of contemporary quantum physics, since the form and nature of the ‘ rule’ 
is left open. It was an historical accident that Kant thought that the unique model for such 
arule would be Newton’s laws of motion. The correct question hence becomes: what should 
be the preferred paradigms for causal action? In this way we can see some of the com- 


temporary perplexities concerning causation and indeterminism might receive some 
enlightening re-interpretations. 


The Philosophical Basis of Physics 393 


take place, and is an absolute necessity, even in the moment of presentational 
immediacy! In that case, however, we are surely left once more without an 
a to our earlier question, unless it can be shown that the concept of the 

universal ’ involved in the case of the ‘ specious present ’ is identical with that 
needed by the supposed feat of ‘transcendence’. Now this is certainly not the 
case, nor—in justice to Kant—did he claim that it was; on the contrary he 
insisted that there was no such identity. 

His solution was that the original problem of induction had been shown to 
evaporate under the heat of his own analysis. In its place he had put, on the one 
hand, the theory just sketched, which amounted to the proposition that there is a 
sense in which forms of scientific reasoning have no basis—unless the spontaneous 
generation of such forms on the part of the creative scientist be counted as such. 
On the other hand, he simply offered his readers the consideration that the 
discovery and verification of actual individual laws (as contrasted with the 
postulation of the concept of lawlikeness) is purely a matter of application of 
the methods of the scientists ; methods involving as a matter of fact the procedures 
of axiomtization and systematization, and the application of regulative ideas, 
which we met with in Section I—the whole thing becoming thereby, it will be 
seen, a slightly circular procedure. 

The passage of this episode in the history of the problem is fairly typical of 
other first rate philosophical problems, or of the same problem in different 
epochs; except that the Hume—Kant case-study offers us the supreme example 
both of magnificent brilliance of performance yielding novel insights for all time 
(achievements as well as failure being all the more obvious to us given our know- 
ledge of hindsight), and also a clear picture of the mode whereby a relentless 
pursuit of a problem when genuinely metaphysical comes to evaporate in a 
peculiar way. 

In the sequel one may discern one or two pronounced tendencies emerging 
from the Hume—Kant ruin. There is on the one hand a strong feeling that it 
isn’t so much that the problem of induction (and the related one of ‘ necessary 
connexion ’) is insoluble, but rather that it is bereft of any genuine significance. 
The most one will concede is that there is a definite meaning to the concept of 
law, adding perhaps, like the Wittgenstein of the Tractatus, that 

“Tf there were a law of causality, it might run: ‘ There are laws of nature’. But 
of course one can not say this: it shows itself. In the terminology of Hertz one might 

say: Only lawlike connexions are thinkable.” (op. cit., 6. 36-6. 361). 

It will be seen that this rather Kantian passage (written over 40 years ago) 
already contains the germ of the ‘ procedural ’ or ‘ work-in-progress ’ doctrines, 
assuming that we can make sense of the notion of lawlike form. On the other 
side, there are still many who persist in the less spectacular and somewhat tedious 
task of elucidating the principles and modes of a justification of induction, 
though deliberately avoiding for this purpose the concept of necessary connection, 
and largely operating with some version of the idea of ‘uniformity’.“*) The 
first group has a tendency to subside into more or less historical accounts of the 
various forms of successful scientific argumentation;@ the second aims at 
formalization and quantification of certain types of argument, and particularly 
emphasises the related problems of probability and statistical reasoning. 
Between these, is anything of philosophical (as contrasted with technical) 


importance still left? 


394 Gerd Buchdahl 


I think it is possible that by a renewed study of some of the significant themes 
and undertones among the episodes which have here come under review we may 
begin to perceive new trends, and new hints for proper ways to tackle some of the 
problems which we have sketched and which (it may be thought) have only been 
lost or temporarily abandoned. These have simply needed stripping of their 
contemporary and quite accidental formulations or fortuitous and encumbering 
preoccupations with matters no longer of interest in our own day. Thus we 
have seen that only a purely conceptual study has any relevance for the discussion 
of the philosophical basis of scientific reasoning though (as the conventionalist 
doctrine implied) we cannot sever this entirely from procedural’ analyses. 
Secondly, we must understand that the resulting problematic concepts of induc- 
tive inference, necessary connection, etc. emerge from subjective analyses of 
structures (e.g. laws) which must be regarded as embedded in or related to 
certain paradigm situations. "This means, that such analyses are not “ God- 
given’, but introduced by us in the capacity of autonomous logicians. 

This notion of ‘ autonomy ’ which emerges is of course familiar to us already; 
for we found that it was the key-concept in the central arguments of the philoso- 
phers from Locke to Kant. It seems likely therefore that a systematic applica- 
tion of the notion might yield satisfactory formulations through which an alter- 
native to the conventionalist position could be envisaged; an alternative which 
would at least have the advantage of confirming the instinctive feeling of the 
scientist, that the forms in which he reasons (inductive and retroductive) are 
‘natural forms of thought’; that truth is the issue, and not just ‘ procedural 
rules’. 


REFERENCES 
(1) BucupanL, Gerp, 1962, Descartes’ Anticipation of a ‘Logic of Scientific Discovery’, 
in A. C. Crombie (ed.), The Structure of Scientific Change (forthcoming), 
(2) BucupaHL, 1951, Science & Logic: Brit. ¥. Phil. Soc., 2, 217-235. 
(3) BucHpanL,1959, Sources of Scepticism in AtomicTheory, Brit. ¥.Phil.Soc.,10, 120-134. 
(4) BucupaHL, 1960, Convention, Falsification & Induction, Aristot.Soc.,Sup.Vol.113-130. 
(5) Popper, Kar, 1959, The Logic of Scientific Discovery, Chs. 1-5, 10. 
(6) RyLe, GitBerT, 1957, S. Koerner (ed.), Observation and Interpretation, 165-170. 
(7) BucupaHL, 1956, Inductive Process & Inductive Inference, Aust. ¥.Phil., 34, 164-181. 
(8) BucupaHL, 1961, The Image of Newton and Locke in the Age of Reason. 
(9) Bucupant, 1957, Science & Metaphysics, in D. Pears (ed.), The Nature of Metaphysics, 
61-82. 
(10) Toutmrn, S. E., 1953, The Philosophy of Science, 3. 
(11) Knearr, WiLiiam, 1949, Probability and Induction, 17-18. 
(12) Brarruwaite, R. B., 1953, Scientific Explanation, 9. 
(13) Waismann, F., 1951, “ Verifiability”’, in A. G. N. Flew (ed.), Essays in Logic and 
Language, First Series, 116-144. 
(14) Sreppine, L. S., 1930, A Modern Introduction to Logic, 21. 
(15) Hanson, N. R., 1958, Patterns of Discovery, 2-4. 


The Author: 

First studied engineering, and after some years, spent as a structural design engineer, 
he succumbed to his interest in general philosophical matters by acquiring graduate and 
postgraduate qualifications in philosophy. Between 1948-58 he built up the School of 
History and Philosophy of Science in the University of Melbourne. He has been Visiting 
Lecturer in Philosophy of Science at Oxford University. At present he is Lecturer in 
Philosophy of Science, and in charge of History and Philosophy of Science and the 
Whipple Science Museum, University of Cambridge, as well as Assistant Lecturer in 
Moral Science, King’s College, Cambridge. 


The Science Masters’ Association Annual Meeting 


Imperial College, 2nd to 5th January, 1962 
by D. W. HARLOW 
Annual Meeting Secretary of the S.M.A. 


Over one thousand members of the Association attended the Annual Meeting 
of the Science Masters’ Association, held in the New Physics Building of 
Imperial College, London, from Tuesday 2nd January to Friday 5th January, 
1962. Once again the international aspect of this meeting was very pronounced, 
overseas visitors coming from Holland, Belgium, France, Northern and Southern 
Rhodesia. West German manufacturers joined in the various exhibitions. 

The Association was honoured on the Tuesday night by a reception by its 
President, Sir Patrick Linstead; five hundred guests being assembled in Ayrton 
Hall, Kensington. The main meeting commenced on the Wednesday morning 
with the opening of a Special Exhibition on the theme of ‘ Instrumentation’ by 
Denzil Freeth, M.P., Parliamentary Secretary to the Minister for Science. 
On the Wednesday evening, Sir Patrick Linstead, F.R.S., delivered his Presiden- 
tial Address entitled ‘ The Practice of Science’. This was immediately followed 
by the Association Dinner, this year attended by the record number of 600 
members and their ladies. The principal guest was Sir David Eccles, Minister 
of Education. 

The usual morning sessions for lectures were this year augmented by evening 
meetings and discussions; the members taking the opportunity of discussing a 
number of important matters covering the work and recommendations of a wide 
selection of the sub-committees through whose important work the main aim of 
the Association, to further the teaching of science, is constantly being pursued. 
Lectures included: 

(1) Driving the Atmosphere by Professor P. A. Sheppard, B.Sc., F.Inst.P., Professor of 
Meteorology; a consideration of the processes involved in the organisation of kinetic 
energy in the atmosphere. It was suggested that the schoolboy may be ready to 
glimpse and appreciate these processes because the laboratory is always around him. 

(2) Some Chemical Aspects of Plant Growth by Professor Helen K. Porter, D.Sc., F.R.S., 
Professor of Plant Physiology; a description was given of current views on the path- 
ways of biosynthesis of polysaccharides and the use made of radio-active carbon in 
their study. 

(3) Some Aspects of Human Communication by Professor Colin Cherry, D.Sc. (Eng.), 
A.M.I.E.E., Professor of Telecommunications; this talk surveyed some of the 
principal fields of human communication including the psychological aspects. 

(4) Experimental Problems in High Energy Nuclear Physics by N. C. Barford, M.A., IBEScs, 
Senior Lecturer in Physics; mention was made of the extremely high energies required 
to give the unstable nuclear particles an independent existence outside the atomic 
nucleus. Observation, at these energies, of particles which are both rare and decay 
with extremely short lifetimes, presents major problems to the experimental physicist. 
Methods of solving these problems in present-day experiments were clearly described. 

(5) New Optical Possibilities for Medical Endoscopy by Dr. H. H. Hopkins, D.Sc., F.Inst.P., 
Reader in Technical Optics; the possibility of making flexible tubes which will trans- 
port optical images from the interior of the body, using the new technique known as 
fibre optics was explored. A description was given of a novel form of conventional 
refracting optics and colour transparencies were shown taken with a new cystoscope 


using the system. 


396 D. W. Harlow 


(6) How can Television and Film be used in the Teaching of Science? by Dre leAe Margerison, 
Science Editor of the Sunday Times; a challenging talk was given on the possibilities 
presented by television and film to assist in the spreading of up-to-date scientific 
knowledge. 


(7) The Physics of Rain and Snow by Professor B. J. Mason, D.Sc., Professor of Cloud 
Physics; this lecture was a review of recent advances in our knowledge of the physical 
processes involved in the formation of clouds and the development inside them of snow, 
rain, and hail. 


(8) Mathematics and Physics in Chemical Reactor Design by Professor KkeG. Denbigh, D.Se., 
Professor of Chemical Engineering Science; in any continuously operating reaction 
system there is a distribution of the times of passage of the individual molecules 
leading to problems in fluid mechanics and heat transfer, as well as chemistry. The 
methods used in chemical engineering to solve such problems were fully described. 


(9) The Problem of the Grey Seal by H. R. Hewer, O.B.E., M.Sc., Reader in Zoology ; the 
lecturer described some of the difficulties, their solutions and the results obtained in 
problems involving conservation of the Grey Seal. 


(10) Cosmic Rays and Space Physics by Professor H. Elliot, Ph.D., M.Sc., Professor of 
Physics; an explanation of the composition of the primary cosmic rays emphasising 
that being charged particles, they are deflected by magnetic fields and can therefore be 
used for investigating the magnetic fields in interplanetary space. An interesting 
account of the work being done along these lines at Imperial College was given. 


(11) Teaching Thermodynamics to Engineers by Professor D. B. Spalding, M.A., Ph.D., 
Professor of Heat Transfer; the need to teach thermo-dynamics in a ‘ macroscopic ’ 
instead of ‘ microscopic ’ manner and the resulting confusion in the minds of first-year 
undergraduates. The lecturer discussed the manner in which these problems arose 
due to the present method of treatment in schools and suggested ways in which they 
may be avoided in the future. 


(12) The Problem of Noise by G. G. Parfitt, Ph.D., B.Sc., Lecturer in Acoustics; the 
‘importance of current research on these problems was emphasised, taking into account 
the physical, psychological and social aspects of the problems. Up-to-date methods 
of noise control were outlined by the speaker. 


(13) Metals Science, Metals Technology and the Future by Professor J. G. Ball, B.Sc., F.I.M.; 
Professor of Physical Metallurgy ; the lecturer posed the problem ‘ What is metallurgy?’ 
and gave several examples of the type of science the work conjured up in people’s 
minds. ‘The influence of the problems to be faced by metallurgists in the field of 
nuclear power generation was explained and the possibility of using the resulting 
techniques in the non-metallic field was explored. 


(14) The Study of Insect Populations by Professor O. W. Richards, D.Sc., F.R.S., Professor 
of Zoology; the study of insect populations has grown into a science of its own with 
quite an extensive body of theory. The possibility that some of these theories may give 
us the power to predict population changes was explored. Several possible inaccuracies 
were mentioned but the value of the study was clearly shown to be at least a method of 
assessing the importance of natural and artificial controlling agencies. 


(15) Fluorine—the Neglected Halogen by Dr. A. J. Rudge, Imperial Chemical Industries 
Research Department, Winnington; the desirability of a ready source of information on 
fluorine in one single elementary text was raised; the possibility of the study of fluorine 
amongst the other halogens already studied in schools was proposed and the lecturer 
was able to demonstrate a simple laboratory fluorine cell. 


The opportunity for members to take part in discussions on the work of a 
number of sub-committees of the Association gave rise to some interesting 
sessions. Matters discussed were:— 


(1) The Draft Report of the Fourth Panel on Science in Sixth Form General Education. 
The discussion was initiated by the Chairman of the Panel, H. P. Ramage and general 
approval of the content of the suggested syllabus was evident. 


The Science Masters’ Association Annual Meeting 307 


(2) New Look at Secondary Modern General Science, initiated by F. W. Kaye, B.Sc., 
Chairman of the Secondary Modern Committee. The committee presented their view 
of what should constitute a General Science scheme in keeping with the great advances 
made in our knowledge of science since the impact of nuclear power. Lively discussion 
arose concerning the ‘ dead wood’ to be excised from present orthodox syllabuses 
in order to make room for new topics. 

(3) The Work of the Modern Physical Science Committee initiated by the Chairman, John L. 
Lewis of Malvern College. This committee was set up at the beginning of 1961 to 
consider the teaching of modern physical science with particular reference to the new 
syllabuses recommended by the Science Masters’ Association. The Chairman ex- 
plained the progress already made by the Committee and outlined its future plans. 
One key problem, i.e. the devising of suitable experiments and the provision of suitable 
equipment, led to the inclusion in this discussion period, of examples of the apparatus 
at present being designed by both British and West German firms. 

(4) The Draft Report of the Committee studying The Training of Science Teachers, initiated 
by D. Scott of the Department of Education, Bristol. Recommendations made by 
the Committee covered such points as the desirability of all teachers of science under- 
going an effective training course; methods of Departmental assessment of students; 
periods of school practice and the possible extent to which Science Teaching Centres 
could be developed. 


An interesting feature of the last London Meeting was again afforded our 
members. Four simultaneous Receptions were given to Members on the evening 
of Thursday, 4th January. The Association is deeply grateful to the Royal 
Institute of Chemistry; the Institute of Physics and the Physical Society; the 
Institute of Biology and the London County Council Education Committee. 

The Members’ Exhibition was the largest ever held by the Association. 
A total of 60 members displayed 152 items, clearly demonstrating the fact 
that modern teachers of science have not lost the art of devising their own simple 
pieces of apparatus and experiments which are best suited to their particular 
pupils. London is always a popular centre for Manufacturers and Publishers 
who once again put on large displays of new equipment and books. Our members 
value this aspect of the Annual Meeting as they have the opportunity of discussing 
their own requirements with a number of firms and, at the same time, can discuss 
with numerous colleagues the pros and cons of particular pieces of apparatus and 
text books. A varied programme of visits and films was also arranged. 

Important matters concerning the future of the Association were discussed 
at the Business Meeting. The Committee were able to announce the appointment 
of E. W. Tapper as first full-time Secretary of the Association. Consequent 
upon this appointment a number of changes of Officers of the Association 
occurred, notably, the dropping .of the office of Membership Secretary, to be 
replaced by a Minuting Secretary and the disappearance of the post of Honorary 
General Secretary. A firm resolution regarding the future financing of Branch 
activities was taken and steps were put in hand to bring about the early formation 
of a single association open to all those concerned in the teaching of science. 
Discussion was also fruitful in the meeting of the Science and Religion Group, 
initiated by Gordon E. Barnes, M.A., on ‘ Science and Faith’. 

A Summary of the high-lights of this meeting would not be complete without 
mention of the tremendous contribution of the Organisations and Firms 
taking part in the Special Exhibitions on the theme of ‘ Instrumentation ’. It is 
significant that members are already asking ‘‘ What is the theme for next year’s 


Special? ”’. 


BOOK REVIEWS 


Dissertations in Physics. Compiled by M. Lots MarckwortuH. (Stanford University 
Press: London, Oxford University Press, 1961.) [Pp. 803.] 140s. 


This book provides a list of authors, titles, institutions and year of all doctoral dis- 
sertations in physics accepted by American universities up to and including 1959. Informa- 
tion is also given about availability of theses through interlibrary loan, or in published or 
microfilm form. The list comprises 8,414 theses, and there is a subject index with about 
six times that number of entries. The latter was compiled by business machine methods 
from the key words in the titles, but these would have to be more carefully chosen by authors 
for the index to be reliably useful in a search of the literature. 

The first American Ph.D. in Physics was given in 1861 at Yale. (The second, in 1863, 
was awarded by the same university to J. Willard Gibbs.) In contrast, it might be mentioned 
that the first D.Phil. in Physics at Oxford dates from 1923. 

A tabulation of this nature has much potential use as a reference book. The work 
successfully passed a test of accuracy applied by the reviewer in that correct reference was 
made to his own dissertation. However, as the pages were turned, one mis-spelling of 
an author’s name and an error in the date of a thesis were noted. W. E. Lams, Jr. 


Fundamentals of Modern Physics. By R. M. E1sperc. (John Wiley, 1961.) [Pp. xiii+729.] 
84s. 


To start with the most important aspect: this is an excellent book and should be 
extremely useful for the teaching of modern physics to second-year undergraduates. ‘The 
book competes with such texts as those by Semat, Leighton, or Richtmyer, Kennard 
and Lauritsen. It is more advanced than Semat’s introductory volume, and not quite as 
advanced as Leighton’s. Its level is comparable to that of Richtmyer, Kennard and 
Lauritsen’s text, but the emphasis is more on the theoretical aspects than on the experi- 
mental data on which the theory is based. The level is sometimes slightly uneven, and 
although the author correctly in this reviewer’s opinion states that only elementary physics 
is needed as a background, the mathematical background needed seemed to be more than 
‘intermediate calculus’. ‘The selection of subject matter is nowadays pretty well standard 
for a text of this kind, and there are no major surprises here, either by commission or 
omission. ‘The literature references seem to be insufficient, and from an educational point 
of view, I should have thought that reference should have been made to at least the more 
important original papers, such as Rutherford’s 1911 paper. The bibliographies are without 
exception lists of other textbooks, and this reviewer feels that undergraduates at this stage 
of their education should start consulting if not original research papers, at least survey 
papers in such publications as Reports on Progress in Physics, Reviews of Modern Physics, 
Annual Review of Nuclear Physics, or for that matter Contemporary Physics. This 
blemish in an otherwise admirable set-up could, of course, easily be remedied in a future 
reprinting. 

After a rather short and insufficient introduction (the author is clearly impatient and 
wants to get on with the main job) the first chapter deals with the special theory of relativity. 
The next four chapters deal with the origin of quantum theory (energy density in the 
radiation field, the photo-electric effect, the Rutherford atom, and Bohr’s old quantum 
theory). Chapter 6 considers the particle-wave dualism and the uncertainty principle. 
Then follow two chapters on the Schroedinger equation, a chapter on perturbation theory, 
and one on the one-electron atom. Magnetic moments, spin, and relativistic effects are 
discussed in chapter 11. After a chapter on the symmetry effects occurring because of 
the indistinguishability of identical particles, the author discusses extensively the multi- 
electron atom and the periodic system. This reviewer rather deplored that some aspects 
were not discussed from the point of view of the old quantum theory : this gives rather a 
pictorial explanation of such effects as the quantum defect. Chapter 14 deals with X-rays, 
and is followed by a very welcome chapter on collision theory in which the transformation 
from the laboratory to the centre-of-mass system is discussed and also the Born approxima- 
tion and partial wave analysis. The book ends with a long chapter on the nucleus which is 


Book Reviews 399 


pleasantly up-to-date, including a discussion of such topics as mesons and parity-non- 
conservation. ‘The only topics which, considering the level the author maintains through 
most of the book, one might have expected to find and which are absent are cosmic rays 
(only briefly mentioned) and at least a list of elementary particles and the present values of 
their masses and lifetimes. The author probably felt that these topics are changing so 
rapidly that they were better left out. 

The price of the book is high, but not impossibly so for the present day and age, and the 
book is beautifully produced and has an extensive index which should increase its usefulness 
even further. This reviewer certainly expects to profit greatly from it, and would like to 
compliment the author and the publisher on producing a valuable addition to textbooks at 
this level of undergraduate teaching. D. TER Haar. 


An Introduction to Linear Network Analysis. By P. S. Faraco. (English Universities 
Press, 1961.) ([Pp. xii+335.] 30s. 


It is impossible to draw a hard and fast line between the fields of the physicist and the 
electrical engineer today. It is not surprising therefore to find a university physicist 
writing a book on Network Analysis. Only the first half of the book deals strictly with 
networks. ‘The rest covers transmission lines (including waveguides) and thermionic valves 
as linear circuit elements. ‘There is no mention of transistors which have certainly as much 
claim today for inclusion in network theory as valves. On the whole the book is well written 
and illustrated though a number of misprints were noticed (e.g. cos for cot on p. 153, x for A 
on p. 154). Inthe past numerous books have been written on physics for electrical engineers. 
This book might appropriately have been called Electrical Engineering for Physicists. As 
such it is fairly successful and may be recommended. However, the claim on the dust 
cover that it is suitable for honours students in light current electrical engineering is some- 
what optimistic. M. R. Gavin. 


On Teaching Voltaic Cells and Electrolysis. ByH.C.ANpDREW. (Modern Science Memoirs, 
No. 41. Published for the Science Masters’ Association by John Murray. London, 
OGTS) eps 2s; 


The title of this little booklet is misleading, as is its introduction. One is led to expect a 
discussion of the problems which face chemistry and physics teachers when they attempt to 
give reasonably satisfactory explanations of experimental facts found in electrolysis. 

However, the author takes pains to avoid all the difficulties: he describes some of the 
reactions and energy changes which occur in some common cells, and gives details of some 
simple and good experiments designed to show certain effects; for example one experiment 
demonstrates the removal of hydrogen ions around the cathode in a simple cell, and another 
shows that oxidation occurs at the anode. 

But a great deal is left untouched. No electrolysis processes other than those occurring 
in cells are considered, not even the electrolysis of dilute sulphuric acid with platinum 
electrodes, with its problem of what happens at the anode. Hydrogen over-voltage is 
mentioned but not discussed, there is no mention at all of electrode potentials, and the name 
of Faraday is not to be found. H. S. PICKERING. 


Physics for the Enquiring Mind. By Eric M. Rocers. (Princeton University Press and 
Oxford University Press, 1961.) [Pp.x+778.] 48s. 


An eight hundred page text, one might think, calls for a very enquiring mind in a student, 
so let it be said at once that the book is not to blame if the student does not at least develop 
one in the process of using it! . 

The book covers a course in physics designed for the non-scientist roughly equal in 
standard to that of an Advanced Level physics syllabus, though the content is not entirely 
the same. ‘The author (a Cambridge graduate, who once taught in England) has selected 
important blocks of material and built them into a connected framework. ‘Thus there is 
little statics, no geometrical optics, the work on heat is mercifully free of lumps of metal being 
transferred to cans of liquid, and a fifth of the book is taken up with a section on planetary 
astronomy, well grounded in observation. It is, however, very different in approach from 


400 Book Reviews 


the usual Advanced Level text and demands serious cooperative thought from the student. 
There is rarely any ‘ bookwork’ to be learnt by heart and triggered off, appropriately or 
inappropriately, by an examination question; instead the student is required to work 
through numbers of dissected problems, with heavy nudges, when necessary, at the crucial 
points—‘ the result is NOT zero’. Throughout the work clear distinctions are drawn 
between definitions, proofs, plausible assumptions and logical deductions from the “ mathe- 
matical sausage-machine’. The risk of circular argument is pointed out when it occurs, 
and the text is never guilty of it. Callendar and Barnes’ determination of f, for example, 
is given a brief diagrammatic mention including the vital information that the current is 
measured in terms of force, and the potential ultimately by an induction method, so that 
when the voltmeter is introduced in the systematic development of electrical theory to 
measure ‘ energy-transfer-per-unit-charge ’, it can be calibrated by a heating coil experiment. 

Theory is introduced as it is needed—‘ Why not modernize further and describe all 
electrostatics in terms of free electrons? Simply because no experiment in this chapter shows 
any behaviour that necessitates electrons ’—and its relation to experiment and definition 
is handled in careful and thorough discussion. Slips are rare, but one might have some 
difficulty in reconciling the statement on page 107, “ The relation we have announced— 
resultant force oc mass . acceleration—is true”’, with the subsequent carefully organized 
development of Newton’s Second Law. More typically, the story of Galileo receives accurate 
historical treatment and the sound judgment: ‘‘ The principal mistake was the same on 
both sides.... Galileo and the Church argued whether it was true.’’ Brownian motion 
unfortunately starts with the widely prevalent error or ambiguity about what Brown actually 
saw. ‘The index is not quite as complete as it might be. ‘These are minor slips in a book 
remarkable for the clarity, consistency and honesty of its presentation. 

When one comes to the nuclear atom, the close-knit arguments of most of the course 
give way, for lack of mathematical tools, to description and analogy, which none the less 
demand critical thought, and should, appropriately, leave the student with ‘ doubts and 
ragged ends of unfinished knowledge ’. 

The most unusual feature of the book is its recognition of the part played by the visual 
imagination in understanding. Well over a thousand attractive diagrams, most of them 
small and simple line sketches, contribute powerfully to the text. They are very fully 
labelled in italic script and are frequently capable alone of conveying the idea involved. 
Financial help from various foundations has aided the publication of the book; both words 
and diagrams serve efficiently as vehicles of thought, partly because neither has had to be 
stinted. It is interesting, incidentally, to notice some of the ancestry of the P.S.S.C. course 
in this volume. ‘This seems to be the more tightly-knit course of the two. ‘Teachers of 
physics could learn much from this valuable book—they might do well to acquire it before 
their students do so ! SISTER ST JOAN OF Arc. 

%&% 


Textbook of Reactor Physics. By J. F. Hitt. (George Allen & Unwin, 1962) allem 7a) 
36s. 


Reactor physics is a borderline subject embracing some nuclear physics, some civil and 
mechanical engineering, some chemical engineering, some economics and so on. This is 
in addition to the specific features of reactor theory which must consider neutrons en masse 
and their diffusion through a variety of materials. The principles are rather simple but in 
practice the calculations of reactor properties very rapidly gets so complex that the largest 
electronic computers are overtaxed. The present volume by the former Principal of the 
Harwell Reactor School can of course be considered a text book for reactor engineering 
students at an introductory level. However its greatest value may be for all those who are 
connected in some way with the reactor industry but are not directly concerned with reactor 
design. This book starts with a potted course in nuclear physics giving only those details 
pertinent to reactors. ‘Then every aspect of reactor design and operation is described 
clearly and succinctly so that the reader can understand what is involved and what points 
are critical. It should be most useful for those engaged in reactor construction, where the 
may often be faced with seemingly ridiculous requirements by the reactor designer e 
very large expenditure of technical resources is being made on various power reactor we 
both here and overseas, and the more understanding of the problems involved by scientific 
and technical ‘laymen’ the better. This book should be useful to them. E. B. Pau 


Electrons in Metals: A Short Guide to the Fermi Surface 


by J. M. ZIMAN 
University Lecturer in Physics, and Fellow of King’s College, Cambridge 


Part III. Dynamics or BLocH ELECTRONS, AND THE CALCULATION OF BAND 
STRUCTURE 


“ Not in utter nakedness, nor in sheer forgetfulness 
But trailing clouds of glory do we come’ 
—Wordsworth. 


1. THE NEED FOR ‘ THEORY’ 


In this part we shall consider two problems that were raised in I, and not 
resolved in II: how is it possible that sometimes the carriers in a metal behave 
as if they were positively charged, and how do we set about actually calculating 
the shape on the Fermi surface in practice? The first question is easily answered 
by a piece of formal analysis; the interaction with the lattice can have a decisive 
effect on the dynamical properties of an electron. The second question requires 
an exposition of the technique of what is usually called ‘ The calculation of band 
structure’. It may seem that this is a somewhat specialized mathematical 
subject, of interest only to the professional theoretical physicist. So it seemed 
until recently. But the latest results in this field are of great value in giving us 
a physical insight into ‘what goes on’ in a metal. We can now, I think, see 
through the elaborate mathematical formulae, and understand instinctively why 
the electrons are not scattered much more strongly by the ions in a metal lattice, 
and why interaction between the electrons is also, in practice, relatively unim- 
portant. 

It is also worth remarking that a good deal of such theory preceded the 
discovery of experimental methods for studying the Fermi surface. The 
interpretation of the experiments is not easy, and the guidance of theory is 
essential. The most fruitful work in this field (as in most branches of physics) 
has come from close co-operation between calculators and observers, between 


theoretical and empirical science. 


2. DYNAMICAL PROPERTIES OF ELECTRONS 


For the calculation of electrical conductivity, we need to know the velocity 
of the electrons on the Fermi surface. In a free-electron gas this presents no 
problem; the velocity is proportional to the momentum : 

v= aes = we (1) 
m m 

The effect of Bragg reflections is to hinder the linear motion of the particle, so 
that, on the average, its velocity may be much less than we should calculate from 
(1). There is a very simple formula for this effect. In wave mechanics the 
wave function is not constant, but has a time-dependent factor. An electron 

in the state of energy &(k) is supposed to have a ‘ frequency ’ 
v(k) = &(k)/h. (2) 


C.P. Ic 


402 J. M. Ziman 


This ‘state’ is thus a travelling wave system extending through the whole 
crystal. But to discuss the ‘ velocity’ we must partially localize the electron, 
and then watch how it moves. ‘To do this we need to construct a wave-packet, 
by superposing wave functions from different states in the neighbourhood of k 
(i.e. of slightly different wavelengths) in phase at our point of localization, and 
then we watch how this wave packet moves. 

We now have a familiar problem of diffraction theory. The formula (2) is 
essentially a dispersion law, relating the frequency of a wave to its wave number 
k. It is well known that the group velocity in such a medium is 

px 2h) _ 12608 ke 
dk h ok 
At least, this is the isotropic case, which is typical of most acoustic, hydrodynamic 
and optical media. More generally, however, we have to think of the possibility 
of somewhat anisotropic media, where the group velocity is not the same, for a 
given wave packet, in all three coordinate directions. In that case we treat each 
direction separately, and write 


_ 1 (a&(k) K(k) «OE (k) 
(vz; Vy) Uz) = Sar ( Ok 5) oky ? Oke ? (4) 


h 
defining each component of velocity as equal to the derivative of the energy with 
respect to the corresponding component of the wave vector. In other words we 
treat &(k) as a continuous function of position in k-space. ‘The velocity of an 
electron ‘ in the state k’ is then simply the local gradient of &(k) in that space: 


1 ! 
v (k)= grad, &(k) (5) 
This result is of great importance and interest. Once we have, by some 
means, calculated or discovered the form of &(k), the simple operation of taking 


local derivatives gives the electron velocity. The whole effect of the crystal 
lattice is contained in the function &(k); there is no need to go back and look at 


Vv 


da \ 


(a) ¢b) 


Fig. 22. For a non-spherical Fermi surface, the velocity of an electron may not always be 
parallel to its propagation vector. 


the actual form of the wave function. Moreover, the velocity of an electron is 
always normal to the energy surface on which it lies. From the shape of the 
Fermi surface we can calculate this direction at any point. For free electrons 
where the Fermi surface is a sphere, it is obvious that the velocity is always 


Electrons in Metals 403 


parallel to the momentum. For ‘ Bloch electrons ’—conduction electrons in 
metals, where the propagation is seriously affected by the crystal lattice—this 
surface may be seriously distorted, and v is not at all sure to be parallel to k. 

To calculate the acceleration of an electron (i.e. of a wave-packet such as we 
have considered) we need a more formidable analysis. "The result may be quoted: 


a force F, such as an electric field, changes the state of the electron in such a way 
that 


hk =F (6) 
This is a Newtonian law—the force equals the rate of change of crystal moment- 
um. It sounds plausible, so long as we are quite sure that fk really is the 
analogue of momentum for an electron in a Bloch state, but a good deal of hard 
work has to be done to justify it exactly. It is now believed to be true, except in an 


external field so enormous as to rival the local field around each metal ion. 
In classical electrodynamics, the force on a moving electron in electric and 


magnetic fields is given by 
a e(E+ — vx H). (7) 
Again, we assume this Lorentz force formula for our Bloch electron, and write 
hk = eB + © v(k) x H. (8) 
The proof of this formula in a periodic potential is a major problem of quantum 


theory, only recently solved. Fortunately there is overwhelming experimental 
evidence that it is very exactly true in all ordinary fields. 


Fig. 23. (a) Energy in a one-dimensional crystal. 


(6) Electron velocity in a one-dimensional crystal. 


4()4 J. M. Ziman 


3. ‘ELECTRONS’ AND ‘ HoLes’ 


The formulae of the previous section have a rather peculiar consequence. 
Consider a one-dimensional crystal, for which &(A), in a single band, takes the 
form shown in fig. 23. Now apply an electric field such as to accelerate a free 
electron to the right. For an electron near the bottom of the band everything 
is quite simple. The value of k increases with time, the electron moves from a 
state such as A to a state such as B, and its velocity increases. We could write 
down a formula for the actual ‘‘ acceleration ” i.e. rate of change of velocity—by 
combining (3) and (8) 


,. 9 (1. 0&(k)\_ 0k ¢ fi 06 (k) 1 02&(R) 
banat (= ak \-talG Fad eae ) 
It is as if the electron had the mass m*, where 
1 1 &é 
m* h2 dk?” 0 


But near the top of the band the situation is quite different. An electron 
‘accelerated’ from C to D, say, finishes up with a smaller velocity than before 
the electric field was applied. ‘The increase of k has brought it nearer to the 
zone boundary, nearer to Bragg reflection, and has thus actually slowed it down. 
The formulae (9) and (10) are still formally true, but the curvature of &(k) is 
now negative so that the ‘ effective mass’, m*, would be a negative quantity. 

From a mathematical point of view there is no difficulty about this, but it is 
inconvenient when we try to apply to such states our intuitive notions and 
experience of the dynamics of ordinary particles. The following trick} saves 
us. Suppose we think about the properties of an empty state in an otherwise 
full band. Such a state will behave as if it had minus the mass of the electron 
that would have filled it. If the curvature of &(k) is negative, then this ‘ hole ’ 
in the full band will behave like a normal Newtonian particle of positive mass. 
But it will also have minus the charge of the electron that would have filled it 
so it will behave electrically as a positively charged particle. 


é K -€ 
Ef 
/, -2u 
| if holes | ©* 
O a Ya O I Se Ya 


Fig. 24. A band that is nearly full of ‘ electrons ’ may be described as a band containing 


a few ‘ holes ’. 
This analysis is so like the now-familiar Dirac theory of the positron that we 
need scarcely pause to think it out in detail. 'To be consistent we should measure 
the energy of a hole’ negatively, that is, downwards from the top of the band 
We have replaced a ‘ band nearly full of electrons ’ by a ‘ band containing a few 


j It is more than a trick: the most com 
: pact wave-packets a 
actually these ‘hole’ states. : sande es 


Electrons in Metals 405 


‘holes’, with a Fermi level, velocity, etc. clustered around the edge of the zone. 
We can think of a ‘ hole’ as an ionized atom—one that has lost an electron—in 
a solid where otherwise the sites are occupied by simple neutral atoms. There is 
a tendency for an electron from a neighbouring atom to fall into the ion, leaving 
the original site ionized. Thus the ‘ state of ionization’ can move through the 
lattice like a positive charge. This motion can have its own kinetic energy, and 
thus behaves like a dynamical particle. Where there are only a few holes in a 
band, this description is much more economical than one where we discuss the 
motion of all electrons into and out of the various sites. 


Fig. 25. The motion of an electron to fill the vacant orbital is equivalent to the motion 
of a ‘ hole ’ carrying a positive charge. 


We can now understand some of the ‘ anomalous’ properties of metals which 
could not be explained inside the free-electron model. Consider, for example, 
the Hall effect. As we saw in I, § 5, the direction of the Hall field depends on 
the sign of the charge carriers. Positively-charged holes can thus be detected 
by their Hall field being opposite to that of ordinary electrons. This effect is well 
known in semiconductors, where the holes at the top of the valence band are very 
important as carriers in ‘ p-type’ material. 

But we can see the same effect in a divalent metal, where the zone is nearly 
full except for regions near the corners and overlaps through the zone faces (see 
fig. 20). The bits of Fermi surface in the overlaps will be electron-like, because 
&(k) is near a local minimum, and thus has positive curvature in that neighbour- 
hood. The bits in the corners are best described in terms of holes, for é(k) has 
a maximum in each corner, and thus has negative curvature (i.e. negative 
effective mass). There are as many holes as electrons, so that such a metal would 
have negative or positive Hall coefficient according as electrons or holes contri- 
buted more to the electric current. To calculate this in detail is difficult, but we 
can certainly now understand in principle why the Hall constant is positive in 
Be, Zn and Cd, and small in Sn and Pb. 

The same argument applies, in a general way, to the thermoelectric effect. 
A current of holes will tend to give a positive thermoelectric power, as observed 
in some metals. But here the situation is really much more complicated, and 


406 J. M. Ziman 


it does not follow that a positive thermoelectric power in a metal is a sign of the 
predominance of ‘ holes ’ in the electrical conductivity. We shall return briefly 
to this point in IV. 

It must not be thought that the separation into electrons and holes is unique 
and unambiguous. It is perfectly possible for &(k) to have “ saddle points 2 
where it is stationary but is neither a maximum nora minimum. In such regions 
the curvature of the surface may be positive in one direction and negative in 


Fig. 26. The state with velocity v will behave like an ‘ electron’ in the plane A, but like a 
‘hole’ in the plane of B. 


another. ‘The dynamics of states in this region may be most unexpected. Itis 


a nice problem to calculate the laws of scattering of such an object in a central 
field of force! 


4. 'THE PROBLEM OF CALCULATING THE BAND STRUCTURE 


So much for the in-principle problem of how it can be possible for apparently 
simple metals to show such anomalous behaviour as a positive Hall coefficient. 
But now we want to make a detailed quantitative study of each metal, and to 
derive numbers for its observed electrical properties. We want to know its 
band structure, or, for greater accuracy, the function &(k) which fixes the shape 
of its Fermi surface, electron velocities, etc. As we shall see in IV and V, there 
are empirical methods for determining the shape of a Fermi surface in detail, but 
we shall never be satisfied until we can also derive this shape by direct calculation 
from the known crystal structure and known potential of each atom or ion. In 
practice it is also almost essential to have some idea of the shape of Fermi surface 
from theory before one can disentangle the strong threads of evidence derived 
from such phenomena as the de Haas-van Alphen effect and the high-field 
magneto-resistance. 

In our one-dimensional model, we found a solution very simply, by con- 
sidering the effect of Bragg reflection from the regular lattice array. The result 
was expressed in terms of one Fourier coefficient of the lattice potential. But 


this was only an approximation, in which it was assumed that we could write 
the wave function in the form 


br = a0 exp (tkx) + B exp [i(k—g)] x. (11) 
In fact the wave function must be much more complicated than this. Because 
the ionic potential is very strong near the nucleus, the electron must have a high 


kinetic energy there, to avoid being captured. It must, indeed, look very like 
the wave function of a valence electron in the free atom, with, say, 2 or 3 nodes 


Electrons in Metals 407 


inside the closed shells of the ion. A formula such as (11), combining just two 
waves of wavelength greater than the lattice spacing, is quite inadequate to 
represent such rapid oscillations. Our primitive solution cannot be generalized 
to provide a reasonable means of solving the practical problem. It is only valid 
when the effect of the lattice is a small perturbation; the actual field of the ions 
is too strong. 


Fig. 27. Typical wave-function of an electron in a crystal. 


5. 'TIGHT-BINDING AND CELLULAR METHODS 


Well then, let us exploit our expectation that inside each ion the wave function 
must be rather similar to the wave function of an electron in the free atom. 
We can, for example, construct a whole set of these functions, 3s, 3p, 3d, 4s, 4p 
....etc., around each ion in the lattice. Then we can take linear combinations 
of these, with appropriately wavy factors, and look to see how they satisfy the 
Schrédinger equation in the ionic potential. From a formal point of view, this 
is quite easy, and physically speaking it is the wave-mechanical analogue of our 
original argument for free electrons. We are simply setting up the problem on 
the assumption that an electron ‘ hops’ from ion to ion, and is eventually shared 
out amongst all the ions of the crystal. 

This method is called the tight-binding method because it starts with each 
electron tightly bound in its own atom, and then allows the bonds to be loosened 
by interaction with neighbours. We can see the effect in a schematic way if we 


—=———-s-state 
= fru. states 


atomic spacing 


Fig. 28. States of the free atom combining to form bands. 


take an array of atoms spaced far apart, and gradually bring them together so 
that the interaction between neighbours increases. We find (fig. 28) that each 


408 J. M. Ziman 


discrete level of the free atom tends to give rise to a whole band of states, the 
width of the band broadening as the atoms come closer and interact more strongly. 
The 3s-state in each of N atoms is two-fold degenerate because of spin. When 
the atoms are brought together, these 2N states combine and spread apart in 
energy to make a band, which we naturally call the 3s-band. Thus, we can 
identify the various energy bands in a metal by the atomic levels from which they 
arose. 

This principle is exemplified in the transition metals. ‘The electrons in the 
d-levels combine to form a ‘ d-band.’ which is more or less distinct in properties 
from the band formed by the outer s-electrons (although they may overlap in 
energy). If this d-band is not full it will contribute to the electrical properties 
of the metal, but in quite a different way from the s-band, where the electrons 
behave as if nearly free. ‘The d-wave functions are drawn rather compactly 
about each atom, so that they do not interact strongly with their partners on 
neighbouring atoms. ‘The d-band is thus rather narrow, and yet may contain 
a high density of electrons. The velocity of the d-electrons must be very 
small, so that they do not carry much electric current, and so on. But here 
we enter a controversial field. The electrical structure and properties of the 
transition metals are not yet understood. 

Unfortunately the tight-binding method is not satisfactory as a practial 
means of calculating the band structure. ‘The wave function is well represented 
inside the ion core (i.e. inside the closed shells), but the method cannot deal with 
the large proportion of the volume of the metal between the ions. In these 
interstitial regions the potential is nearly constant, the kinetic energy of the elec- 
tron relatively small (e.g. the Fermi energy) and the wave function rather smooth. 
Atomic orbital wave functions die off exponentially outside their atoms; this is 
too drastic. 

The situation can be saved to some extent by constructing cellular wave 
functions. We draw a unit cell around each atom, and then solve the Schrédinger 
equation under new conditions—for example that the wave function in each 
cell should join smoothly on to the wave-function in the neighbouring cell—in 
practice, on to itself again with a phase factor. ‘These boundary conditions can 
be made explicit, and the problem can be set up and solved systematically. The 
results are quite good in practice, but the problem tends to be ‘ over-determined ’ 
—the conditions for smoothness of join across the whole plane separating two 
neighbouring cells are an infinite set, so that it is difficult to choose from them a 
small number of the most significant relations to determine the coefficients in 
the linear combinations of solutions. The computational difficulty emphasizes 
the weakness of the method in principle. We have introduced an artificial 
boundary in the region where the potential is actually most nearly constant. 


6. MUFFIN-TIN POTENTIALS, APW’s AND OPW’s 


We can come closer to the real physical situation if, instead of drawing 
artificial boundaries between the cells of the crystal we deliberately treat the 
region between the ions as empty space in which the electrons propagate as 
simple wave planes. Around each ion we draw a sphere, as small as we dare, 
inside which the potential is spherically symmetrical and rather like the potential 


Electrons in Metals 409 


in a free atom. In the interstitial region the potential is constant. The poten- 
tial map then looks like a cake-, cookie-, or muffin-tin (according to one’s 
continent), with each circle-pattern the site of an ion. 


Fig. 29. Muffin-tin potentials. 


There are various ways of solving for the electron wave-functions in such a 
system. In the method of augmented plane waves (APW’s) one tries to match 
travelling plane waves in the interstitial region to sets of spherical harmonics and 
radial functions satisfying the Schrédinger equation inside each atomic sphere. 
The matching conditions are now defined over the spherical boundary surfaces, 
and are therefore much simpler than for a polyhedral unit cell. In Korringa’s 
method, or the method of Kohn and Rostoker each ion sphere is treated as a source 
of scattering for the plane waves in the interstitial space, and one looks then for 
a combination of such plane waves that will stay steady, self-consistently, for 
this scattering. The ‘ best ’ such combinations can be chosen by the application 
of a variational formula. 

In the hands of experts, both these methods can be made to yield good 
results, consistent with one another and with experiment. ‘The mathematical 
formalisms are complicated, but the separation of the volume of the crystal into 
two regions, with appropriate representations of the wave function in each 
region, is rather nice in principle. This is something like the physical reality. 

But there is another extremely elegant method, which is both convenient 
in use and illuminating in principle. It is a little more subtle than those methods 
which we have discussed, and therefore needs more careful explanation. We 
recall a fundamental theorem of wave mechanics, which states that different 
solutions of the Schrédinger equation must be orthogonal* to one another. 

Our conduction electron must be in a state which is orthogonal to all the 
states of the electrons in the closed shells of the ions, otherwise the conditions 


+ This is the mathematical counterpart of the word ‘ different’ or ‘ independent ’ when 
applied to electron wave functions. If s, and #, are truly different solutions of the Schré- 
dinger equation, each capable of being separately occupied by an electron, then the integral 
of the product y,7/2 over space must vanish. In practice it means that the different wave 
functions must have different numbers, or different arrangements, of nodes. 


410 J. M. Ziman 


of the exclusion principle would not be properly satisfied. Herring suggested 
(in 1940) that one ought to make sure that the wave function had this property 
from the very beginning before trying to make it satisfy the Schrodinger equation. 
He proposed a neat technical trick for this. One takes a plane wave with a definite 
k-vector, exp (ik .r), and one combines it with an appropriate mixture of wave- 
like states constructed out of the ion ‘ core’ wave functions—the mixture being 
chosen so as to be exactly orthogonal to just these core states. This new function 
is called an orthogonalized plane wave (OPW). Then we write down the lattice 
potential (essentially of muffin-tin form), and calculate what combinations of 
OPW’s (i.e. corresponding to different values of k) best satisfy the Schrodinger 
equation. 

This all sounds most artificial—but it turns out that only a very few different 
OPW’s are needed to get quite close to the answer. The point is that each OPW 
is a very good representation of the sort of wave-function that an electron must 
haveinametal. ‘The ‘ core’ wave functions are rather compact functions, which 
vanish outside the closed shells of the ion. ‘Thus, in the interstitial region an 
OPW looks very like a simple plane wave. But in the core it can have plenty of 
oscillations and enough nodes to look very like the wave function for the next 
atomic state above the closed shells. 


(¢) OPW = plane wave - core function 


Fig. 30. Synthesis of an orthogonalized plane wave. 


Electrons in Metals 411 


7. ‘THE EFFECTIVE POTENTIAL 


Out of all these calculations there arises a rather remarkable result. In the 
end, the conduction electrons look much more free than we had any reason to 
expect when we started on the problem. The ionic potential is a very strong 
local potential, with a singularity like Ze?/r near the nucleus. The Fourier 
component of this potential at a wavelength equal to the lattice spacing, must be 
many electron volts, so that we should expect very large energy gaps and strongly 
distorted Fermi surfaces. All the divalent metals ought, one feels, to have been 
semiconductors. 

Yet in many cases the calculated energy surfaces are not very different from 
spheres, and the energy gaps are usually only one or two electron volts. In 
Al, for example, Heine calculated that a single OPW whose energy is nearly 
the same as that of a free-electron of the same k-vector, was a good representation 
of the solution of the Schrédinger equation, except at points very near a zone 
boundary. From an energy point of view, the free electron model is much better 
in practice than it appeared at first sight. 

We are now beginning to understand the reason for this fortunate circum- 
stance. Calculations of band structure are not simply to find any solution for 
the wave function of an electron in a periodic lattice. There are plenty of these 
solutions, corresponding to bound states in the ion cores. But these lower 
states are all filled. We are looking for the ’ next’ band of states above them. 
These ‘ conduction ’ states must be orthogonal to all the core states, and this puts 
a serious constraint on the form of the wave-function. It means that the wave 
function of the conduction electron must oscillate rapidly inside the core, and 
therefore have a rather high kinetic energy. ‘The kinetic energy is enough to 
balance out most of the potential energy of the core region, so that there is not 
really a strong barrier to the electron entering or leaving each atomic sphere. 
This means that each ion does not scatter the conduction electrons very strongly, 
so that the scattering power of the atomic planes is rather weak. 


Bae 


¥ 


(a) (b) 


Fig. 31. An OPW in the true potential (a) can be replaced by a simple plane wave in the 
effective potential (0). 


412 J. M. Ziman 


This argument can be made more or less quantitative. In aie OPW 
method we constructed the final solution of combinations of OPW’s, each of 
which was a plane wave with orthogonalized wiggles in the ion cores. . The rules 
for building up each OPW involve various integrals of wave functions in the 
core, etc. But these rules can be transformed until, in the end, they can be 
made to have the same effects as an extra energy term in an ordinary Schrodinger 
equation. Our problem becomes equivalent to constructing a solution for this 
modified Schrédinger equation out of simple plane waves. 

We have made the orthogonality condition look like a potential. The 
electron resents the high kinetic energy that it must acquire to enter the core, 
so that this pseudo-potential is obviously repulsive. But the ordinary ion poten- 
tial is attractive so that these two fields tend somewhat to balance out. The net 
effective potential is thus quite small—just a few electron volts. For such a 
small potential the solution of the modified Schrédinger equation is very simple; 
the nearly free-electron method of II § 3 is adequate as a first approximation. 

This argument is not rigorous and does not, perhaps, offer the best technical 
procedure for a precise calculation of band structure. But it shows quite clearly 
why the free-electron model works so well as a first approximation. We must 
not insist that the wave function of a conduction electron is just a simple plane 
wave.—lIf we do that, the deep potential well at the nucleus of each ion will have 
an enormous effect on the energy. We must allow the wave function to oscillate 
rapidly whenever the electron falls into such a well so that it can nearly compensate 
the local negative potential with a large positive kinetic-energy term. This 
compensation can be so good that the energy of the electron in the lattice is very 
much the same as if it were perfectly free. The remaining coherent scattering 
by the lattice planes can be dealt with as a relatively small perturbation. 


8. SCREENING 

One general formal problem remains. What atomic potential should we use 
in our band-structure calculation? How can we allow for the fact that our 
array of ions is bathed in a dense gas of electrons, which strongly interact with 
each other and with the lattice? This is a many-body problem, for the electrons 
are identical particles and must be treated collectively—all 1023 of them. It has 
been a challenge to theoretical physics for a generation, and has not been solved 
completely. But we now think we understand the major phenomena for a free- 
electron gas (in jellium, of course) even if we have not yet succeeded in putting 
such a gas into a real lattice of metallic ions. 

Happily, our intuitive notions are being confirmed. The idea of a Fermi 
surface, with the classification of states by their k-vectors is still valid. The 
low-energy excitations of the whole system can be described by a scheme which 
is equivalent to statements like ‘ the change of energy when we take an electron 
out of the state k, below the Fermi surface, and put it into k’, above the Fermi 
surface, is given by 

06 =6(k’)—&(k).’ (12} 
Or we can say ‘ the change of momentum of the whole gas in this transition is 
exactly Wk’ —hk.’ Or ‘the velocity of the excitation is v(k) =(1/h) grad, &(k).’ 
We can go on thinking of the electrons as nearly independent particles each with 
charge @. ‘The only effect of the strong coulomb interaction between them is to 
modify the form of é(k), which is now not quite equal to h?k?/2m. 


Electrons in Metals 413 


The reason for this is that the electron gas itself acts as a very efficient electri- 
cal screen. Put a point charge into the gas—say a negative charge, —Z |e}. 
This will repel electrons. As they move with high velocity (remember the 
Fermi energy!) past the charge, they will be deflected from it. But there is a 
background jelly of positive charge. In the region round our foreign body the 


Fig. 32. Excitation of a Fermi gas. 


Fig. 33. A negative charge screened by an electron gas. 


electron gas is now not sufficiently dense to neutralize the positive jelly. In fact, 
since our system must be neutral over-all, 7 electrons will be driven away, and 
in the neighbourhood of —Z|e| the total excess positive charge on the jelly 
will be exactly +Z|e|. Looked at from afar, the field of our foreign charge 
is almost exactly compensated by this positively charged cavity in the electron 
gas. 

To calculate this effect in detail, in a self-consistent way, is a formidable 
problem. But one can show that the field of the foreign charge will be almost 
negligible at a distance of a few lattice spacings. Instead of — Z| e|/r, we shall 
observe something like 


mice! ep (ari) (13) 


where /, the screening length, is of the order of the interatomic distance. 

This argument holds for any charge immersed in a gas of free electrons. 
We can apply it to the ions of the crystal lattice, and suppose that the strong 
Coulomb field of each ion is screened by the gas of conduction electrons. ‘The 
potential seen by any one electron moving through the lattice is thus not nearly 
as great as we should calculate on the basis of ‘ bare’ ions alone. Here is another 
reason why the actual effects of the lattice on the ions—the energy gaps etc.— 
are much smaller than we thought at first. 


414 J. M. Ziman 


One can even apply the same analysis to each conduction electron, as if it, 
too, were a foreign body largely screened by the electron gas in- which it is 
immersed. Each electron is surrounded by a shadowy sphere, into which other 
electrons do not easily penetrate. The positive charge of the jelly in this sphere 
equals the charge on the electron, so that the whole object looks electrically 
neutral. It moves through the volume with nearly the same kinetic energy as 
a bare electron. There is just the small correction to the free-electron formula 
because of the difficulty of keeping ‘ other’ electrons out of the shadow region 
when our chosen electron is moving rapidly. The interaction between two such 
excitations is now quite small. It will be a residual electrostatic field like (13), 
which can only act at distances of the order of the screening length /. The 
powerful, long-range Coulomb interaction has been cut down to a very local 
interaction that can easily be treated by perturbation methods and is not very 
important. 

Nevertheless, after all this, the choice of atomic potential is still very uncertain. 
We have not discussed at all the exchange effects, associated with the special 
interactions between electrons of the same spin. All these band-structure 
methods, for all the mathematical ingenuity and hours of computer time that have 
been spent on them, have not really succeeded in giving us good a priori know- 
ledge of the electronic structure of metals. When I have said that a method is 
good, I have meant that its results have turned out to be in good agreement with 
an already existing experimental determination of the shape of a Fermi surface. 
The methods for such determinations will be discussed in IV and V. 


Editor’s Note : 
Offprints of Dr. Ziman’s articles on Electrons in Metals will be bound together and 


published in booklet form on the completion of the series. They will be available from 
the Publishers early in 1963. 


The Metre 


by H. BARRELL 
The National Physical Laboratory, Teddington 


1. INTRODUCTION 


On 14 October 1960 the Eleventh General Conference of Weights and 
Measures assembled in Paris unanimously agreed to a fundamental change in 
the way the metre is defined. The platinum-iridium bar known as the Inter- 
national Prototype Metre was thus deposed from the supremacy it had held in 
the field of length measurements since 1889 and in its stead the wavelength of 
light was established as the fundamental standard of length. In the new 
definition the metre is declared equal to 1 650 763-73 wavelengths in vacuo of 
the orange line (A= 6058 A, spectroscopic designation 2p,)—5d;) emitted by the 
krypton atom of mass 86. 

One of the fascinating aspects of the metric system of weights and measures 
as originally founded in France during the Revolution was the creation of the 
metre unit of length derived from a particular dimension of the Earth. By 
specifying a unit of measurement equal to a ten-millionth part of the quadrant 
of a meridian of the Earth it was intended to endue the metre with qualities 
of permanence and invariability as durable as the Earth itself was considered to 
be. ‘This was no doubt the justification for the temporal reference in the metric 
slogan “‘ A tous les temps, a tous les peuples’’. 

At the same time it was decided to construct a material representation of the 
metre unit in the form of an end-standard made of a platinum bar on which this 
distance was equal to the separation of its nominally flat and parallel ends when 
the bar was at the temperature of melting ice (0°c). This material standard of the 
metre was deposited (with a platinum kilogramme) in the archives of the Republic 
and is accordingly known as the Métre des Archives. Apart from its com- 
memorative aspect the Métre des Archives was also regarded as a primary 
standard which it was intended to reproduce, if damaged or destroyed, by 
reference to the length of a simple pendulum beating seconds at a given latitude 
and altitude, rather than to the natural standard of the Earth’s dimension. 
Much effort was indeed put into determining the relation between the length of 
the seconds pendulum and the metre, but the proposed procedure for reinstate- 
ment was never used. 

When, in 1889, the more precisely defined International Prototype Metre 
became established as the means of defining the metre, the greatest care was 
taken to reproduce as closely as possible the length of the Métre des Archives 
and not that depending on the size of the quadrant of the meridian. Even so, 
the transfer was a difficult operation involving comparison between two differing 
kinds of length standard, the new Metre being a platinum-iridium line-standard 
on which the unit was represented by the distance, at 0°c, between two parallel 
defining lines when the bar was supported horizontally in a specified manner. 

Less than 30 years had elapsed since the Métre des Archives had been legally 
established when the first suggestion was made, during the period when the wave 


416 H. Barrell 


theory of light was being developed and experimentally verified, that the wave- 
length of light would provide a natural standard of length. After the lapse of 
a further 130 years or so, the epoch-making decision has now been taken adopting 
the wavelength of a well defined atomic emission line as the fundamental basis 
of length measurement. It is therefore of interest to review the scientific history 
of the metre, tracing the advances which have been made since the metre was 
derived from a terrestrial base to an accuracy of 1 part in 5000, through the period 
during which material standards were used to maintain the unit, ultimately 
with an accuracy of 2 parts in 10’, to the present epoch when the metre is 
reproducible through an atomic standard to 1 part in 108. 


2. ESTABLISHMENT OF THE METRE 


The metric system of weights and measurest was devised in France during 
the Revolution to overcome the confusion due to incoherency and multiplicity 
of the measures in use. The extreme confusion in France at that time contrasted 
markedly with the situation in England, where strong central government and 
mercantile development had imposed and maintained a unified system of weights 
and measures from the time of Edward III at least and probably from an earlier 
period. 

In recommendations made to the Constituent Assembly in 1790, Talleyrand, 
Bishop of Autun, proposed the creation of a new system based on a natural 
standard depending on the length of a simple pendulum beating seconds at 
latitude 45°. He also suggested collaboration between the Paris Academy of 
Sciences and the Royal Society of London in experiments designed to realize 
this concept, which was concurrently being considered as a means of realizing a 
standard length in Britain (by Sir John Miller) and also in the U.S. (by Jefferson). 

Later in 1790 an Academy commission appointed to consider scientific 
aspects of a unified weights and measures system reported in favour of using 
the decimal scale for deriving multiples and submultiples of weights and measures 
as well as of coinage. In March 1791 another Academy commission indicated 
that choice of the means of defining the unit of length could be made from among 
the following three natural bases: 

(1) the length of a pendulum beating seconds; 
(11) a quadrant of the terrestrial equator; and 
(ii) a quadrant of the meridian between the terrestrial pole and 
equator. 

The commission rejected (i) and (11): (i) because it involved the concept of 
time and the phenomenon of gravitational acceleration in its determination and 
(11) because uniformity of the equator was not so well established as for the 
meridians, and equatorial regions were less accessible. So the choice fell to the 
quadrant of a meridian, of which the ten-millionth part (later denominated the 
metre) was to become the basic unit of measurement. As complete measure- 
ment of a meridian was not possible an arc was selected of suitable length between 
Dunkirk and Barcelona which extended north and south of latitude 45° (latitude 
of mid-point 46° 11’ 58”) over about 92° of latitude. Part of this (Paris) 


+ The sources mostly used for information on the creation and early history of the metre 
were references “) and (2). 


The Metre 417 


meridian had been measured twice previously. Among the prescribed operations 
were : (i) determination of the difference in latitude between Dunkirk and 
Barcelona; (ii) measurement of two geodetic bases; and (iii) measurement of 
the distance between Dunkirk and Barcelona along the meridian by triangulation 
using the bases as reference. 

After receiving sanction for the scheme from the Legislative Assembly on 
30 March 1791, the Academy set up five commissions to supervise various 
aspects of the work, with a central commission (Borda, Condorcet, Lagrange 
and Lavoisier) to direct the whole project. It is recorded that members of all 
the commissions were received by Louis XVI on 19 June 1791, the very eve of 
the flight from Varennes—an omen of the troublous times to come during which 
this remarkable scientific project was nevertheless brought to a successful 
conclusion. 


Fig. 1. One end of the Totsz pe Pfrou, an end-standard made of an iron bar (section 
4 x 1 cm approximately) 6 Paris feet (1:949 m) in length between the inner end-faces 
exposed by the cut-away portion at each end. ‘The small indentation visible just 
below the working face was used (with the corresponding indentation at the other 
end) to set trammels to the length of the Toise. The scale engraved near the lower 
edge is not mentioned in the early descriptions and was probably added at a later 
date. (Photograph reproduced by permission of the Director of the Bureau Inter- 
national des Poids et Mesures.) 


In 1793, impatience with the slow progress of the work and the urgent need 
of a standard led to the National Convention (which had by then replaced the 
Legislative Assembly) adopting a provisional metre equal to 443-44 lignes of the 
Toise de Pérouft, a value derived by La Caille from earlier measurements of the 
Paris meridian. On 7 April 1795 the basic law adopting the metric system of 
weights and measures was passed by the Convention. ‘The system thus founded, 
though based on the terrestrially derived metre, was to be maintained by means 
of a prototype standard made of platinum and deposited in the Archives of the 
Republic. 

+ This iron bar (fig. 1), also known as the Toise de |’Académie and the first standard 
of any precision available in France, was made by La Condamine in 1735 for his expedition 
with Bouguer to Quito to measure the length of a degree of the meridian at the equator. Its 
length of 6 Paris feet was reputedly equal to that of a mural standard in the Chiatelet in 
Paris. 12 lignes=1 inch, 12 inches=1 foot (Paris measure). 


CaP» 1D 


418 H. Barrell 


3. MEASUREMENT OF THE ARC OF THE MERIDIAN 


Meanwhile, in 1792, Delambre and Méchain commenced measurements of 
the meridian as soon as their instruments were ready. For the angular measure- 
ments, both in azimuth for geodetic triangulation and in the vertical plane for 
astronomical determinations of latitude, use was made of Borda’s ‘ repeating ’ 
circle (cercle répétiteur, fig. 2). With this each angular setting of the observing 
telescope was related to the divided circle (4000 intervals) by vernier readings 
obtained at four symmetrically disposed positions around the circle. By taking 
the mean of these four readings as the angular setting, errors in circular division 
and in concentricity of the rotational axis of the telescope with respect to the 
circle were minimized. 


Fig. 2. Be oe “repeating ’ circle used by Delambre and Méchain for precise geodetic 

and astronomical measurements of angle. (Reprod é 
la 5 produced from Base d 

Métrique Décimal, Tome 2, PI. VIL.) ge 2 


The Metre 419 


In the triangulation process of measurement a base of known length, forming 
one side of a particular triangle in the chain, is required. ‘Two bases were used, 
one at Melun, near Paris (see fig. 3) and the other near Perpignan, some 330 000 
toises (640 km) southwards along the meridian. These bases were measured 
by means of four double-toise bimetallic rules (platinum and copper) designed by 
Borda to determine the effects of temperature on the length. 


St Martin du Tertre 


0 
Dammartin @Qrleans 
18.0]000 
a u-neuf 
Pa 0) 
lnvalides_A 
Panthéon&————___ \ 
B 

BOjson 


ontaine 


ERIDIEN 


% Melun &% Ennordre 


Malvolsine 
OCOgues 


Tour du Mérévillee » Chapelle 


20 000 


Pithiviers® ® Srorme/l/e 


10 000 


0 


Bois -commun % Dun 


Chatillong 


x 
Orléans » Belvedere 


Echelle de 30000 Toises 


000 
Fates Morlace 


+190 


MERIDIENNE 


Vouzon St. Saturnine dee 


Chaumont 


Fig. 3. Diagram of part of the network of triangles (including the Melun base) used by 
Delambre and Méchain in measuring the arc of the meridian between Dunkirk and 


Barcelona. (Copied from Base du Systeme Meétrique Décimal, Tome 3, Pl. III.) 


Owing to the disorganized condition of France at this time the field work 
was frequently interrupted and exceedingly difficult to accomplish. However, 
after surmounting a multitude of obstacles to the progress of their survey, 


420 H. Barrell 


Delambre and Méchain completed the observational work on the meridian in 
1798. A special commission} was set up by L’Institut National (which had 
replaced the former Academy, suppressed in 1793) to review the work done and 
report on the results obtained. ‘The final calculations gave the following result, 
which depends on the figure adopted for the polar flattening f of the Earth 
(f=(a—6)/a, where a=equatorial radius and b= polar radius): 


Quadrant of meridian (10? metres) =5 130 740 toises in terms of the Toise 
de Pérou at 13°R (16:25°c); 1 metre=0-513 074 0 toiset =443-296 lignes. 


The value which had been provisionally adopted earlier for the metre was 
443-44 lignes. 

The platinum end-standard, cross-section 25-3 x 4 mm, was constructed and 
adjusted by Janetti so that its length at 0°c was equal, as near as possible, to 
the unit deriving from the measured terrestrial meridian and on 22 June 1799 it 
was deposited, in full diplomatic and legislative ceremony, with a platinum 
kilogramme in the National Archives, where the two prototypes were to be 
“ conserves avec un soin religieux’’. 


4. ACCURACY OF THE GEODETIC MEASUREMENTS 


In the detailed account later prepared by Delambre of his and Méchain’s 
measurement of the arc of the meridian, Delambre concludes that both the value 
of the metre in terms of the terrestrial quadrant and the polar flattening term 
adopted by the special commission were too small. His final values are compared 
below with those adopted: 


Delambre Adopted 
Value of 1 metre 443-328 lignes 443-296 lignes 
Flattening (f) 1/309 1/334 


In relation to the modern value for the ten-millionth part of a terrestrial quadrant 
the adopted metre is 0-20 mm (or 1 part in 5000) shorter than intended: 
Delambre’s value is less short by 0-07 mm and the provisional metre, being 
0-32 mm longer than the adopted metre, was approximately 0-1 mm in excess of 
the intention. ‘The modern value of f derived from observations on artificial 
satellites is 1/298-30. As the measured arc of the meridian comprised only about 
one-ninth of the complete quadrant, the result finally derived for the metre 
depended somewhat critically upon the value assigned to the flattening f, which 
was not accurately enough known at that time. 

An indication of the remarkable accuracy achieved in the triangulation is 
provided by comparing the directly measured length of the Perpignan base with 


+ There were seven members, among whom were Delambre, Laplace, Legendre 
Méchain and Van Swinden (who drew up the report). 


t Or 1 tose =1-949 036 6 metre. Guillaume(*) reports measurements of the Toise de 
Pérou made in 1887 and 1890 at the Bureau International des Poids et Mesures by Benoit 
who found the length to be 1:949 090 metre between the end-faces (see fig. 1); the dicenee 
between the indentations was 1:949 001 metre. One of Borda’s four doublestoise bimetallic 
rules was also measured in 1894 by Benoit and Guillaume, who found its length to be 
3-898 068 metres, a value remarkably close to that of 3-898 073 metres evaluated by Borda. 


The Metre 421 


its length as derived in terms of the Melun base (length 6075-90 toises) through 
a chain of 53 triangles connecting the two bases: 
Perpignan base (measured) = 6006-249 toises 
A » (derived) = 6006-089 
The difference is 0-160 tozse or barely 3 parts in 10°. 


” 


5. PROGRESS OF THE DECIMAL METRIC SYSTEM 


At first there were many difficulties in bringing the metric system, especially 
its decimal scale of multiples and submultiples, into common use in France and 
concessions were made in 1812 permitting the old measures to be employed for 
ordinary purposes of trade, e.g., a toise of 2 metres, a pied of 1/3 metre, a pouce of 
1/36 metre and a ligne of 1/1296 metre. These concessions were rescinded in 
1837 by an act which decreed that, from 1 January 1840, all weights and measures 
other than those defined by the decimal metric system established by the act 
of 1795 (and a subsequent act of 1800) were proscribed. 

After this rebirth of the decimal metric system in all its pristine coherence, 
the French government made every effort to disseminate knowledge of the 
system in other countries. For this purpose advantage was taken of the oppor- 
tunities provided by the great international exhibitions of the middle 19th 
century. The London (Crystal Palace) Exhibition of 1851 demonstrated in 
most dramatic fashion the great diversity of systems of weights and measures 
in customary use throughout the world and the emerging need for a coherent, 
unified international system. Progress towards the ideal in mind was accelerated 
by the Paris Exhibition of 1867 and by 1872 seven other European states and a 
good part of South America had adopted the decimal metric system. Great 
Britain legalized its optional use in 1864. 

Between 1867 and 1869 the Academy of Sciences of St. Petersburg, the newly 
created International Association of Geodesy and the Paris Academy of Sciences 
(which had been reinstated) all suggested that new metric standards of length and 
mass should be prepared. ‘The French government accordingly called an inter- 
national conference (Commission Internationale du Métre) which assembled in 
Paris in 1870, and again in 1872, to discuss the construction and distribution of 
such standards. Preparatory work was initiated in Paris under the auspices of an 
international committee nominated by the first conference, and in 1874 the French 
government issued invitations to delegates of all interested countries to participate 
in the Paris Conférence Diplomatique du Métre of 1875. ‘Twenty sovereign 
states accepted the invitation and on 20 May 1875 the plenipotentiaries of 17 
of them signed the treaty known as the Convention du Métre. An agreement 
was thereby made to establish an international weights and measures organization 
primarily concerned with propagating and perfecting the metric system. To 
this end a central laboratory and office—the Bureau International des Poids et 
Mesures (BIPM)—was created and maintained (as it still is) at common expense, 
with the duties of (1) making all necessary verifications of new international 
prototypes of the metre and kilogramme, (2) taking permanent custody of these 
prototypes and (3) periodically comparing the national copies of these standards 
with the international prototypes or their well-authenticated copies. The 
French government undertook to provide a suitable site for the BIPM, which 
was eventually located at the Pavillon de Breteuil, Sevres. Under the terms of 
the Convention the work of the BIPM (which now includes the fundamental 


422 H. Barrell 


metrology of temperature, electricity, photometry and the ionizing radiations) 
is managed by an International Committee of Weights and Measures, meeting 
every two years, which is elected by, and under the authority of, the General 
Conference of Weights and Measures, meeting every six years. The General 
Conference is constituted by delegations from the Member-States of the Con- 
vention du Métre, now numbering 38. 


6. Tue INTERNATIONAL PROTOTYPE METRE 


Asa result of the preparatory work during the period 1867-75 it was decided 
to make the length of the new standard equal to that of the Métre des Archives 
“dans l état oi il se trouve’’+ and to construct it of an alloy of 90 per cent platinum 
with 10 per cent iridium which had been studied by Fizeau, for its metrological 
qualities, and by the chemist Sainte-Clair Deville. Instead of making the new 
standard of the end-type, like the Métre des Archives, it was considered preferable 
to produce it in the form of a bar, of special X-section (fig. 4) as proposed by 
Tresca and designed to give maximum rigidity for minimum use of material, 
bearing on its exposed neutral plane two parallel defining lines separated at 
0°c by the distance equal to the unit metre length as represented on the Métre 
des Archives. The neutral plane of the bar is the most suitable because the 
distance between the lines engraved thereon is least influenced by the manner 
of supporting the bar, which is nevertheless specifiedt. 

Many difficulties were encountered in the metallurgy of the specified alloy. 
Since it had been agreed to make a number of copies for distribution as national 
standards to other countries, it was considered desirable to make them all from 
a single ingot, which involved the casting at one time of 250 kg, an unprecedented 
quantity of a most refractory alloy. All the platinum and part of the iridium 
was purchased in London from George Matthey, who assisted Deville in super- 
vising the casting of the alloy at the Conservatoire des Arts et Métiers on 13 May 
1874. () 

It was subsequently found that the quality of the alloy fell somewhat below 
the exacting specification demanded by the International Committee of Weights 
and Measures; in particular its mean density of 21-1 g/cm? was well below the 
expected range of 21-385-21-455 g/cm® and indicated the presence of impurities. 
Although some experimental metres and kilogrammes were constructed from 
the so-called Conservatoire alloy of 1874, the French government, after much 


} Similarly, the new standard of mass was to be made equal to the Kilogramme des 
Archives ‘‘ dans son état actuel”’. 


t In a recent study of the flexure of length standards Volet() concluded that the inter- 
national type of metre line-standard (102cm long) should be supported symmetrically 
at two positions 559mm apart, but that the length of a metre line-standard, ruled precisely 
in the neutral plane, varies by less than 0-01 zm for any separation in the range from 48 to 
63cm. Bessel’s calculations during 1835-38 gave the separation as 571mm (or 0:5594 of 
the length of the bar) for the condition when the shortening in length due to flexure is a 
minimum. Airy, on the other hand, adopting the condition in which the end-portions of 
the bar are horizontal (or have zero slope) derived a formula which indicated that for two 
supporting positions their separation is 0:5774 (or 1/4/3) of the length of the bar. The 
‘ Airy ’ positions are more commonly employed with the end-standard because they endue 
the bar with the property that the end-faces remain parallel whether the bar is supported 


horizontally at the ‘ Airy’ positions, or vertically, either on one of the ends or freely 
suspended in a suitable manner. 


The Metre 423 


Fig. 4. Section designed by Tresca for the former International Prototype Metre and 
the national copies (dimensions in mm). The neutral plane coincides with the upper 
edge of the bridge of the section. 


hesitation due partly to the conflicting advice received, eventually decided in 
1882 to order a new supply of the alloy from Johnson, Matthey and Company for 
the construction of 30 metre bars and 40 kilogramme cylinders. The new 
material met the specification satisfactorily, i.e., minimum density 21-5 g/cm?, 
platinum 90-25-89-75 per cent, iridium 9-75-10-25 per cent, with small 
tolerances for impurities such as ruthenium, rhodium, palladium, copper, silver, 
gold and other elements. The International Prototype Metre was made of the 
Johnson—Matthey alloy and some copies were constructed from the Conservatoire 
alloy, but most countries expressed a preference for standards made of the 
Johnson—Matthey alloy. 

In 1889, when the 1st General Conference of Weights and Measures 
assembled in Paris, the new standards were ready for distribution. A particular 
bar (no. 6) was denominated as the International Prototype Metre, its defined 
length being declared to be equal, within the limits of experimental error (say 
about 1 um), to the superseded Métre des Archives. The certified lengths of 
the copies were given in terms of the metre represented on the Prototype. 
Distribution of the copies to the various Member-States of the weights and 
measures organization, which Great Britain joined in 1884, was made by drawing 
lots. 

The definition of the metre adopted by the lst General Conference was 
eventually refined in some detail to the following form, which was adopted by 
the 7th General Conference in 1927‘ and remained in force until the Prototype 
was superseded by the wavelength definition adopted by the 11th General 


Conference in 1960: 

L’unité de longueur est le Metre, défini par la distance, a 0°, des axes des 
deux traits médians tracés sur la barre de platine iridié déposée au Bureau inter- 
national des Poids et Mesures, et déclarée Prototype du Métre par la Premiere 
Conférence Générale des Poids et Mesures, cette régle étant soumise a la pression 
atmosphérique normale et supportée par deux rouleaux d’au moins un centimetre de 
diamétre, situés symétriquement dans un méme plan horizontal et a la distance de 


571 mm l’un de I’autre. 


424 H. Barrell 


It will have been noticed that both the Métre des Archives and the Inter- 
national Prototype Metre were defined to represent the unit metre distance when 
the bars were at the naturally fixed temperature of melting ice. By using this 
fixed thermometric point at 0°c, any uncertainties in the length of the standard 
due to errors in measuring its temperature by thermometers were eliminated from 
the definition. In practice, however, it is not always convenient to make 
comparisons at 0°c, even of primary standards in the laboratory, while for every- 
day use lengths must be standardized at normal working temperatures. It 
follows that a knowledge of the coefficients of thermal expansion of the bars is 
required and for this reason much effort has been put into determining the thermal 
dilatations of metals used for standards of length, and particularly of the platinum- 
iridium metre bars. 

It is interesting to examine the degree of precision attained in comparisons 
of the metre bars and the constancy with which the metre has been maintained 
since 1889. A list was published by Guillaume) showing the identifying 
numbers of 29 metre bars, the countries to which they were originally issued 
(some have changed ownership since first issue), their values during 1888-89 in 
terms of the Prototype and, where available, their values during 1920-22 and 
the differences between the values at the two dated periods. The mean difference 
is + 0-21 4m, the mean change is — 0-09 um and the standard error of a single 
difference value is 0-28 um. 


, (a) (6) 
Fig. 5. (a) One end of the British copy (no. 16) of the Metre showing a pattern of defining 
lines ruled on a polished facet in the neutral plane of the bar. 

(6) Microphotograph of the pattern of defining lines seen in (a) with the finer 
graticule lines used for visual settings by micrometer in the standard form of 
microscope comparator. The distance between the two thicker vertical lines is 
0-173 mm and represents the expansion of the bar when raised in temperature from 
0°c to 20°c. (Crown Copyright Reserved). 


Many of the bars have been recalibrated at the BIPM on several other 
occasions and no. 16, the British copy (fig. 5), has the following history of values 
given by Leclerc‘): 


The Metre 425 


Difference from 


Date 1m at 0°c in pm Remarks 
1889 —0-59 


1922 — 0-66 See footnote + 
1933 : 


Standard error of single 
determination = 0-10 um 


Table 1. Values of the British copy (no. 16) at different dates. 


After the calibration of 1956, the defining lines on no. 16 were removed, by 
repolishing the two facets in the neutral plane on which the lines were engraved, 
and then re-ruled so that metre lengths at 0°c and 20°c were delineated (see 
fig. 5b). The BIPM certified values (1956) of the two metre lengths newly 
defined on metre no. 16 are: 

' 1 m—0-84 um at 0°c (0°c line) 

. 1m—0:54 um at 20°c (20°c line) 
In 1957 the bar was compared at the NPL with certain highly stable Laboratory 
standards which were known in terms of no. 16 before it was re-ruled. The 
results of these comparisons showed that the values previously assigned to these 
standards agreed to within +0-1 um with the values in terms of the re-ruled 
no. 16. This meant that the link made at the BIPM between the old and new 
values of the national copy was highly precise and that a reproducibility of 
0-1 um is obtained in different laboratories with metre line-standards of very 
high quality. 

Several other national copies, including the working standards of the BIPM, 
have also been successfully re-ruled during the past few years, the work being 
done through the agency of the BIPM by the Société Genevoise d’Instruments 
de Physique. Great improvements in the technique of ruling fine lines on 
hard, polished surfaces, in positions toleranced to a fraction of 1 wm, have been 
made since the metre bars were issued some 70 years ago. Besides the greater 
precision of reproduction of the metre thus made available through standards 
bearing fine lines of symmetrical profile (in cross-section), there is the advantage 
nowadays of being able to ‘ observe’ such standards under photoelectric micro- 
scopes providing a precision of setting some 10 times better than the visual 
microscope, for which the standard error of a single setting is of the order of 


0-3 um. 


+ The result for 1922 given above is the value of no. 16 at that date as subsequently 
amended by the BIPM because of the adoption of a common value of the thermal expansion 
coefficient for all the bars made of Johnson—Matthey alloy; the 1889 value was not modified. 


426 H. Barrell 


Leclerc (loc. cit.) expresses the view of the BIPM that, using a modern 
example of the classical form of microscope comparator with visual observations, 
the difference between two metre-standards provided with modern defining 
lines can be determined with an uncertainty not exceeding 0-1um. Such 
precision could not possibly be guaranteed for the whole family of metre bars 
as ruled in 1889 because of the general lower quality of the lines engraved at 
that time in relation to modern rulings. Nevertheless, Leclerc considers that 
the International Prototype and its copies have preserved the unit metre distance 
with a precision between 0-1 and 0-2 zm. Sears®) in 1936 made an estimate of 
between 0-2 and 0:3um. Other evidence coming from the international work 
done to establish the optical metre, which is next to be discussed, suggests a 
precision of 0-2 um or 2 parts on 10’. 


7, LIGHT-WAVES AS STANDARDS OF LENGTH 


Although Babinet@® was the first to direct attention to the concept of the 
light-wave as a standard of length, there is little doubt that the same idea had 
occurred to others who were at that time developing and verifying the wave 
theory of light. Later on, Fizeau devised the interferometric dilatometer which 
he used to measure the thermal expansion coefficients of samples of the platinum- 
iridium alloys for the new metre line-standards. In the same month, September 
1889, in which the 1st General Conference of Weights and Measures adopted 
the International Prototype Metre, Michelson and Morley“) published the 
paper ‘ On the feasibility of establishing a light-wave as the ultimate standard 
of length’, in which they described how the now well-known Michelson inter- 
ferometer could be used to determine the number of wavelengths in a metre. 

The ideas of Michelson and Morley were realized by Michelson and Benoit?) 
at the BIPM during 1892-93, when the first direct measurement of the metre 
in terms of wavelengths of the cadmium red line (and incidentally of the green 
and blue lines) was made. In 1905-06, Benoit, Fabry and Pérot(!®) repeated the 
determination with the cadmium red line but using other methods and apparatus 
based on the Fabry—Pérot interferometer (or étalon). The International 
Solar Union (now the International Astronomical Union) adopted@), in 1907, 
the wavelength of the cadmium red line as the reference basis for all spectroscopic 
measurements of wavelengths and defined its value in terms of the international 
angstrom (A). Using the relationship between the wavelength of this line and the 
metre, as originally announced by Benoit et al., namely A, = 6438-4696 x 10-10 m 
in standard airt—a value agreeing with that of Michelson and Benoit within 
the limits of experimental error—the angstrom was defined by assigning the 
value 6438-4696 A to Ap. 

In 1923, the International Committee of Weights and Measures) accepted 
in principle the possibility of eventual adoption of the metre defined in terms of 
wavelengths of light, subject to the formulation of satisfactory conditions for its 
practical realization, and urged the national standards laboratories to undertake 
investigations for this purpose. Four years later the 7th General Conference(® 
although rejecting a proposal to adopt the cadmium standard as the Teds 


+ The modern specification of standard air for spectroscopy and metrology is: dry air 


at 15°c, under a pressure of 1 013 250 dyn/cm? and containing 0-03 per cent by volume of 
carbon dioxide. 


The Metre 427 


reference for the metre, gave formal sanction, as an interim measure, to determina- 
tions of length being based on the value of Xp = 6438-4696 x 10-19 m as an alter- 
native to use of material standards of the metre. 

Following the International Committee’s recommendation of 1923, the first 
result to be announced was that of Watanabe and Imaizumi@”, who used a copy 
of the apparatus of Benoit, Fabry and Pérot made by Adam Hilger Ltd. 

The determinations so far mentioned made use of purely line-standard 
techniques to relate the position of a line or lines engraved on an interferometer 
standard (or etalon) to the graduations on a copy of the metre by microscope 
observations. With the development of modern types of end-standard con- 
structed from steel blocks or bars having flat, parallel terminal faces with a 
mirror-like finish of optical perfection, it became feasible to apply interferometric 
methods, as devised by Michelson and others, directly to the measurement of 
this form of standard. Thus the end-standard was used in the two determina- 
tions of the metre (and one of the yard) made by Sears and Barrell(@* 1 at the 
NPL during 1933-35, the three made by Késters and Lampe@® 2% 2 at the 
Physikalisch-Technische Reichsanstalt (now Bundesanstalt) during 1933-37 
and the one made by Romanova, Wahrlich, Kartashev and Batarchukova(22) at 
the USSR Institute of Metrology, Leningrad, in 1940. 

Correlation of the optically measured end-standards with the fundamental 
line-standards was effected by the well-known metrological procedure for trans- 
ferring from line- to end-measure (see, for instance, reference 23). 

Other important changes were introduced into the techniques used for these 
more recent wavelength determinations. With the equipment used at the NPL 
and the Institute of Metrology the end-standard was measured both in air under 
known and controlled conditions and im vacuo; with the PTR apparatus the 
end-standard was measured in air and a concurrent measurement made of 
refractive index of the same air over an optical path of 1 metre. Platinum 
resistance thermometry was used to measure the temperature of the end-standard 
by all observers after 1927. 

The mean value for the number of wavelengths in standard air of the cadmium 
red line contained in the metre, derived?) from the nine determinations made 
between 1892 and 1940, is 1 553 164-12, and the corresponding value of A, is 
6438-4696 x 10-1 m (range of values: 6438-4682 — 6438-4713 x 10-19 m;_ maxi- 
mum deviation from the mean: 2-6 parts in 10’; standard error of a single 
determination: 0-:0011 x 10-1°m; and standard error of the mean: 0-0004 x 
10-®m). In a recent re-examination of the data, Hart and Baird@®), using 
the internationally adopted dispersion equation for air@®, derived a value 
6440-2490 x 10-19 m im vacuo, corresponding to the value 6438-4695 x 101° m of 
A, in standard air of refractive index 1-000 276 381. Both assessments lead to a 
value of A, which, by happy chance, is identical, or nearly so, with that adopted 
for spectroscopy in 1907 and for metrology in 1927. Moreover, there is no 
evidence of any trend with time of the individually measured values of the metre 
in wavelengths, thus indicating that the hierarchy of platinum-iridium line- 
standards is secularly stable. 

The question may well be asked why the General Conference so long re- 
frained from superseding the material standard of the metre. Although proposals 
to adopt a wavelength standard were made from time to time, notably in 1927 
as already mentioned, none was wholly acceptable until recently, mainly because 


428 H. Barrell 


no known optical radiation, emitted from a natural element such as cadmium, 
possessed a significantly higher reproducibility than the best material standards. 
The great advance leading to adoption of the optical metre was mainly due to 
the introduction and development of improved sources of monochromatic 
light employing, instead of the natural elements, pure isotopes (mononuclides) 
of even atomic charge and mass and, therefore, zero nuclear spin. 


8. ISOTOPE SOURCES OF MONOCHROMATIC LIGHT 


It is well known that an atom can exist in a number of different states cor- 
responding to different configurations of the electrons surrounding the nucleus, 
that each state is characterized by a particular energy value, and that changes 
from one state to another are accompanied by the quantized radiation or 
absorption of an electromagnetic wave having the frequency v, where: 

v=(E,—£),)/h, 
h being Planck’s constant and FE, and E, the energies of the two states. As the 
wave travels, in vacuo, with velocity c, the corresponding wavelength A is: 
A=ch|(E,—E,). 
The quantities ¢ and h are fundamental atomic constants, whereas E, and E, are 
energies of the atom as a whole. 

The wavelengths of the spectral lines emitted by a particular group of the 
same kind of atom are determined by differences in energy level between the 
great many states that are possible. Most of the lines formerly recognized 
for use as wavelength standards in spectroscopy and metrology are emitted by 
elements with mixed isotope content, very often with appreciable abundances 
of isotopes having odd and even atomic masses, e.g. cadmium, mercury and 
krypton. A single isotope with odd atomic mass produces lines with hyperfine 
structure due to interactions between the nuclear and electronic spins. A single 
isotope with even atomic charge and mass, and therefore with zero nuclear 
spin, emits lines free from hyperfine structure and is the best source of an optical 
standard. Lines emitted from elements with two or more isotopes having different 
masses (whether odd or even) are also subject to isotope displacement effects— 
the corresponding radiations from isotopes of different masses are not coincident 
—and display unwanted structure or apparent broadening due to overlapping 
of hyperfine-structure components and isotope displacements. 

Figure 6 is a comparison photograph illustrating the remarkable simplicity of 
the green line from mercury-198+ in relation to the complexity of the same line 
from natural mercury, which is a mixture of five isotopes of even mass with two 
of odd mass. ; 

Besides the line-complexities characteristic of odd-mass isotopes and of 
isotope mixtures, there are other broadening effects on spectral lines which are 
now better understood and therefore capable of control. Of these the natural 
width is small but inevitable, for it is due to the finite widths of the two energy 
states concerned in the emission of radiation. The Doppler width, due to random 


+ Mercury-198 is prepared, using a kind of alchemy in reverse, by irradiating pure gold 
with thermal neutrons in an atomic reactor; some of the gold is transformed by neutron 
absorption into a short-lived radioactive isotope of gold which, after emission of a f-ray, 
becomes the stable isotope of mercury with atomic mass 198, separable from the gold by 
distillation. Krypton-86, the isotope finally selected as the source of the fundamental 
radiation, is separable from natural krypton gas by a process of thermal diffusion. 


The Metre 429 


Fig. 6. Comparison photograph of Fabry—Pérot fringes in the green line of mercury-198 
isotope (left) and natural mercury (right); the principal fringes in natural mercury 
are broadened by overlapping of the five slightly displaced components due to the 
five isotopes of even mass and the hyperfine structure visible between the principal 
fringes is due to the two isotopes of odd mass. (Crown Copyright Reserved.) 


thermal motions of the emitting atoms, is dependent on the square root of the 
ratio of the absolute temperature to the atomic mass, and in most sources is 
several times larger than the natural width. ‘These broadenings are symmetrical 
and produce no shift in the mean wavelength of the line. Pressure broadening 
effects, on the other hand, are usually asymmetrical and lead to line-displacements. 
They depend on the temperature, density and nature of the atomic neighbours 
of the emitters and can be classified as follows: 

(i) collision broadening, due to the presence of foreign atoms (impurities 

or carriers) with the emitters; 


430 H. Barrell 


(ii) resonance broadening, due to the presence of atoms of the same kind 

as the emitters; and 

(iii) interatomic Stark broadening, due to the emitters being within the 

influence of ionic and electronic fields. 

The effects of pressure, together with those due to self-absorption, or self- 
reversal, in emission lines, as well as the broadening and complexities due to 
external electric fields (Stark effect) and magnetic fields (Zeeman effect) can be 
minimized if the best modern sources of high-quality monochromatic radiations 
are utilized under carefully controlled conditions of excitation. 


9, THE OPTICAL METRE 


At the 9th General Conference (1948) a resolution was adopted@” recognizing 
the possibility of defining the metre eventually in terms of wavelengths of a 
radiation emitted from a pure isotope, such as krypton-86, mercury-198 or 
cadmium-114. In 1953 an advisory committee appointed by the International 
Committee made recommendations®®) which (a) strongly favoured use of the 
vacuum wavelength of a line, emitted by a single unperturbed atom of a pure 
even-mass isotope at rest with respect to the observer, as the means of redefining 
the metre, (b) proposed that the selected radiation be specified solely by the two 
spectroscopic terms denoting the atomic states between which the energy 
transition takes place, and (c) described how the transfer should be made from 
the material to the natural standard. Continuity with the existing definition of 
the metre was to be preserved through the intermediary of the relationship, in 
standard air, \1,= 6438-4696 x 10-1° m, formerly accepted for spectroscopy and 
metrology and verified by the nine determinations of the metre. Since the new 
wavelength standard was to be related to the vacuum condition, a dispersion 
formula for standard air needed to be specified, and to this end the 
committee adopted that evaluated by Edlén@®, leading to the value 6440-2490, 
x 10-1° m for the vacuum wavelength of the cadmium red line. The committee 
was unable at that date to select the most suitable source and radiation owing to 
the lack of sufficient comparable experimental data on isotope radiations. 

The 10th General Conference (1954) accepted these recommendations and 
following this, many investigations were initiated on the spectroscopic mae 
and vacuum wavelengths (in terms of the vacuum wavelength of the cadmium 
red line) of radiations emitted by suitable isotopes. The results of these 
investigations, which were carried out at the BIPM and the national laboratories 
of Australia, Canada, Germany, Japan, the UK, the US and the USSR, were 
discussed at meetings of the advisory committee held in 1957 and the ee 
line of krypton-86, in well-defined conditions of emission and observation 
was selected as the basis for an optical definition of the metre. 

The adopted value for the number of waves in the metre was the mean 
derived from ovelomn comparisons with the specified vacuum wavelength 
(6440-2490, x 10-!® metre) of the cadmium red line made in five laboratories 
(range of values 3 parts in 108). A consequence of the method used to evaluate 
the metre in wavelengths is that, by definition, the angstrom now equals 10-1° 

quals m 
whereas formerly it merely had this value nominally. 
ahdpach diference’of 40 om, for the dings |e EMI aevE aE 
line of mercury-198. The ee were s oereghE oe 

photographed at the NPL from an 


The Metre 431 


Fig. 7. Comparison photographs of Fabry—Pérot fringes in the green line of mercury-198 
(left) and the orange line of krypton-86 at the same path difference of 40 cm; 
approximate limits of interference (or coherence lengths) are 50 cm for the mercury- 
198 line and 80 cm for the krypton-86 line. (Crown Copyright Reserved.) 


Engelhard hot-cathode krypton-86 lamp(?°3)) at 63°k and a Meggers electrode- 
less mercury-198 lamp(*?) maintained near 0°c by water cooling. The superior- 
ity of the fringes in krypton light is clearly demonstrated. 

The definition of the metre unanimously adopted(?) by the 11th General 
Conference (1960) is embodied in the following resolution : 


“1. Le métre est la longueur égale 4 1650 763,73 longueurs d’onde dans le vide de la 
radiation correspondant a la transition entre les niveaux 2p,) et 5d; de l’atome de 
krypton-86. 

2. La Définition du Metre en vigueur depuis 1889, fondée sur le Prototype International 
en platine iridié, est abrogée. 

3. Le Prototype International du Métre sanctionné par la Premiere Conférence Générale 
des Poids et Mesures de 1889 sera conservé au Bureau International des Poids et 
Mesures dans les mémes conditions que celles qui ont été fixées en 1889.” 


The source of the standard radiation recommended in 1960 by the Inter- 
national Committee is the Engelhard lamp already mentioned (fig. 8), operated 
according to the following instructions (?°): 

“‘ Conformément au paragraphe | de la Résolution 2 adoptée par la Onzieme Con- 
férence Générale des Poids et Mesures (octobre 1960), le Comité International des 
Poids et Mesures recommande que la radiation du krypton 86 adoptée comme étalon 
fondamental de longueur soit réalisée au moyen d’une lampe a décharge a cathode 
chaude contenant du krypton 86 d’une pureté non inférieure a 99 pour cent, en 
quantité suffisante pour assurer la présence de krypton solide a la température de 
64°, cette lampe étant munie d’un capillaire ayant les caractéristiques suivantes: 
diamétre intérieur 2 a 4 millimétres, épaisseur des parois 1 millimétre environ. 


432 H. Barrell 


“On estime que la longueur d’onde de la radiation émise par la colonne positive 
est égale, 4 1 cent-millioniéme (10~*) prés, a la longueur d’onde correspondant a la 
transition entre les niveaux non perturbés, lorsque les conditions suivantes sont 
satisfaites : 

1. le capillaire est observé en bout de fagon que les rayons lumineux utilisés 
cheminent du cété cathodique vers le cété anodique; 

2. la partie inférieure de la lampe, y compris le capillaire, est immergée dans un 
bain réfrigérant maintenu a la température du point triple de l’azote, a 1 degré 
pres; 

3. la densité du courant dans le capillaire est 0,3 +0,1 ampére par centimetre 


carré.”’ 


Fig. 8. The Engelhard type of hot-cathode krypton-86 discharge lamp shown removed 
from the cryostat in which it is cooled to the triple point of nitrogen; means are 
provided for controlling and measuring the temperature of the lamp as well as other 
conditions of excitation. (Crown Copyright Reserved.) 


In a paper ‘ On the relation between the old and the new definition of the 

international metre ’, Hart and Baird (Joc. cit.) have reported measurements in 
terms of the new definition, made in the laboratories of the National Research 
Council, Ottawa, on four 1-metre line-standards which were well known in 
terms of the former International Prototype Metre. Their results are quoted 
in Table 2. 
The results in Table 2 indicate with fair certainty that the optical metre is 
shorter than the former International Prototype Metre by 0-2 um, an amount 
which is no greater than the uncertainty associated with determmittions of length 
in terms of the former Metre or its well-authenticated copies. : 


The Metre 433 


Length of standard Number of wavelengths 
Standard at 20°c in M O-M 
in terms of M derived from the comparisons 

M16 1m— 0:54 um 1 650 764-16 —0:26 »m 
M4 1 m+ 167-37 3-92 —0-11 
M169 Tm+ 1:15 4-09 — 0-22 
M306 1 m+ 269-91 4-13 — (0-24 

Mean value 1 650 764-08 —0-21 


Table. 2 Results of comparisons of four 1-metre line-standards with the krypton-86 
standard (A vac). 
©=1 650! 763°73 N vac. 
M=1 metre defined by the former International Prototype Metre. 


Notes: M16 is the British copy of the Metre, no. 16, made of platinum-iridium; 
M4 is a platinum-iridium metre from the US National Bureau of Standards; 
M169 is an invar metre, re-ruled in 1952, from the National Research Council, 


Ottawa ; 
M306 is a nickel metre, the former Canadian legal copy. 


Work is proceeding in the national laboratories with the object of refining 
still further the specification for the krypton-86 discharge lamp, using greatly 
improved interferometric techniques of wavelength comparison which are now 
capable of yielding a precision of 1 or 2 parts in 10°. An extremely interesting 
alternative type of source now under study(** 3 3%) for the production of highly 
monochromatic light is the atomic beam used either in emission or absorption. 


ACKNOWLEDGMENTS 


I am indebted to my colleague Dr. A. H. Cook who provided information 
and material on the early geodetic measurements of the arc of the meridian 
obtained by him for an exhibition on The Metre held at the NPL during May 
1961. Acknowledgments are also made to the Director of the Bureau Inter- 
national des Poids et Mesures and to the Director of the National Physical 
Laboratory for permission to reproduce material used for some of the figures. 


REFERENCES 


(1) Base du Systeme Métrique Décimal: Mesure de l’Arc du Méridien par MM. Méchain 
et Delambre, Tomes 1, 2 et 3, rédigées par M. Delambre; Recueil d’Observations 
Géodésiques, Astronomiques et Physiques, Tome 4, rédigée par MM. Biot et 
Arago (published in Paris between 1806 and 1821 for L’Institut de France). 

(2) Le Systéme Métrique Décimal, 1930 (Paris: Gauthier- Villars). 

(3) GurtiaumE, C. E., 1927, La Création du Bureau International des Poids et Mesures et 
son Oeuvre, p. 128 (Paris: Gauthier-Villars). 

(4) Voter, C., 1952, Etude sur la flexion des étalons de longueur. 
Poids Mes., 21, 1. 

(5) McDona_p, D., 1960, A History of Platinum, p. 207 (London: Johnson, Matthey & 


Co. Ltd.). 
(6) Trav. Bur. int. Poids Mes., 1930, 18 (C. R. 7° Conf. gén. 1927), 49. 


Caps 


Trav. Bur. tnt. 


1E 


434 H. Barrell 


(7) See reference (3), p. 125. 
(8) Lecierc, G., 1958, P. V. Com. int. Poids Mes., 26B, M48. : 
(9) Sears, J. E., 1936, Our basic standards of measurement. Science Progress, p. 220. 
(10) Basinet, J., 1829, Ann. chim. phys., 40, 177. 
(11) Micuetson, A. A. and Morty, E. W., 1889, Amer. F. Sct., 38, 181. 
(12) Micuetson, A. A. and Benoit, J. R., 1895, Trav. Bur. int. Poids Mes., 1 Beis 
(13) Benoit, J. R., Fasry, C. and Péror, A., 1913, Trav. Bur. int. Poids Mes., ily 3. 
(14) Trans. int. Un. Sol. Res., 1907, 20, 28. 
(15) P. V. Com. int. Poids Mes., 1923, 10, 67. 
(16) See reference (6), p. 52. 
(17) WaranaBeE, N. and Imaizumi, M., 1928, Proc. imp. Acad. (Tokyo), 4, 350. 
(18) Sears, J. E. and Barret, H., 1934, Philos. Trans. A, 233, 143. 
(19) Késrers, W. and Srars, J. E., 1935, P. V. Com. int. Poids Mes., 17, 113. 
(20) Késters, W. and Lamps, P., 1934, Trav. Bur. int. Poids Mes., 19 (C. R. 8° Conf. gén. 
1933), 79. 
(21) Kosrers, W., 1938, Werkstattstechnik, 32, 527. 
(22) Romanova, M. F., Wauruicu, G. V., KartasHev, A. I. and BatarcuuKova, N. R., 
1942, C. R. Acad. Sci. URSS, 37, 46. 
(23) Ann. Rep. Nat. Phys. Lab. Tedd., 1919, p. 90. 
(24) BarreLi, H., 1950, Chambers’s Encyclopaedia, 9, 188. 
(25) Hart, K. H. and Bairp, K. M., 1961, Can. 7. Phys., 39, 781. - 
(26) EpLEN, B., 1953, 7. Opt. Soc. Amer., 43, 339. ; 
(27) Trav. Bur. int. Poids Mes., 1952, 21 (C. R. 9& Conf. gén. 1948), 44. 
(28) P. V. Com. int. Poids Mes., 1954, 24, M43. 
(29) P. V. Com. int. Poids Mes., 1958, 26B, M30. 
(30) Késtrrs, W. and ENGELHARD, E., 1950, P. V. Com. int. Poids Mes., 22, 137. 
(31) ENGELHARD, E., 1958, P. V. Com. int. Poids Mes., 26B, Annexes M2-M7. 
(32) Merccers, W. F. and WEsTFALL, F. O., 1950, ¥. Res. Nat. Bur. Stand., 44, 447. 
(33) C. R. 11¢ Conf. gén. Poids Mes. 1960 (Paris Gauthier-Villars, in the press). 
(34) KessLer, K. G., Barcer, R. L. and Scuweitzer, W. G., 1959, Nat. Phys. Lab. Symp. 
no. 11 “* Interferometry”, p. 67 (London: HMSO 1960). 
(35) Merssner, K. W. and Kaurman, V., 1959, 7. Opt. Soc. Amer., 49, 942. 
(36) Barcer, R. L. and Kesster, K. G., 1961, 7. Opt. Soc. Amer., 51, 827. 


The Author : 


Studied at the Royal College of Science, South Kensington, and later specialized in 
spectroscopy under Albert Fowler. Entered NPL (Metrology Division) in 1923 and has 
published papers on length interferometry and the metrology of the basic standards of 
measurement. Superintendent of Standards Division and member for the UK on the 
International Committee of Weights and Measures. 


Lasers 


by D. J. E. INGRAM 
Professor of Physics, The University, Keele 


SUMMARY 


This article outlines both the basic ideas and principles of the laser 
(light amplification by stimulated emission of radiation), and summarizes 
briefly the experimental results that have been obtained so far. The develop- 
ment of the laser has followed very closely on that of the maser, which pro- 
duces amplification in the microwave region, and which has already been 
discussed in Contemporary Physics. The practical possibilities and applica- 
tions of lasers are even more promising than those of their microwave counter- 
parts, however, because they not only act as amplifiers with low internal noise, 
but also serve as a source of coherent radiation in the visible region, where none 
has hitherto existed. The implications that this has for communication and 
similar purposes are also discussed. 

The article deals with the general ideas of a coherent and incoherent 
radiation; stimulated and spontaneous emission; and inversion of energy 
level populations before applying these in detail to the visible region. ‘Two 
methods that have been successfully employed to invert energy level popula- 
tions and produce coherent visible radiation are then described, and the 
likely direction of future work is considered. 


1. INTRODUCTION 


Most physicists are now aware that their vocabulary has increased during 
the last year or two by the appearance of a group of names which include such 
terms as ‘ maser ’, ‘ laser’ and ‘ iraser’, and in as much as the last three letters 
appear to be common to all these new names, most physicists are also vaguely 
aware that these new devices have something to do with stimulated emission of 
radiation. The very great importance of these new devices does, in fact, rely on 
this process of stimulated emission to produce coherent radiation, but it would 
probably be fair to say that the practical importance of the devices comes from 
their ability to produce coherent radiation, irrespective of the actual mechanism 
whereby this is obtained. It is necessary to understand exactly what is meant by 
coherent and incoherent radiation before the particular applications of these new 
devices can be discussed. It will be seen in the next section, that one of the 
crucial factors which determine whether the radiation emitted is to be coherent 
or not is the probability that an incoming photon has of stimulating emission in 
the system compared with the probability of being absorbed by it. It therefore 
follows that the inversion of normal energy level populations is required for the 
former probability to be the greater, and thus for stimulated emission to occur. 

This inversion of the energy-level population was of course obtained for 
systems operating in the microwave region before the same idea was applied in 
the optical region of the spectrum. Professor Stevens has already written an 
article on ‘ Masers’ for Contemporary Physics (October 1960), and the main 
object of this paper will be to show how these ideas have been extended to 
‘lasers’ (i.e. light amplification by stimulated emission of radiation). The 


436 D. J. E. Ingram 


third section of the present article outlines the historical development of the 
various processes which have been used to produce such inversion of the energy 
level populations, and deduces the general factors which are required for operation 
in any region of the spectrum. The actual application of these ideas to the 
production of coherent radiation in the visible region is then taken up and the 
final section deals with the practical results that have been obtained so far with 
these coherent radiation sources, and also considers the fascinating possibilities 
that are now opening up in the way of future applications. 


2. COHERENT AND INCOHERENT RADIATION 


It will probably be best to approach the whole concept of coherence in electro- 
magnetic radiation by considering one or two definite examples. ‘To begin with 
we might take radiation at one extreme of the electromagnetic spectrum, such as 
normal broadcast radio waves. We can represent the transmission of such radio 
waves into space by the simple diagram shown in fig. 1. The electric and 


| q 


Bigg Radio frequency emission from two dipole aerials. (a) Coherent wave trains 
with related phase. (6) Incoherent wave trains with unrelated phase. 


magnetic fields, which form the electromagnetic radiation as it is propagated out 
from the aerial, are themselves produced by the flow of electrons up and down the 
aerial wire. If we take the particular case of dipole fed at its centre, as shown in 
fig. 1, we know that the two ends must be nodes for current flow and antinodes 
for voltage distribution. We can therefore visualize pulses of current flow moving 
up and down the aerial itself, and in so doing giving rise to changing electric and 
magnetic fields of the same frequency. Fora continuous uninterrupted sinusoidal 
wave train to be produced it will of course be necessary that the current flow 
itself should also be of a similar regular periodic nature. It is only because 
the movement of the electrons is correlated in this way to produce a current 
pulse which moves steadily up and down that a coherent wave train is obtained. 


Lasers 437 


If the individual electrons were moving up and down the wire with random phase, 
the emitted radiation would be a collection of individual wave trains having no 
correlation in phase—i.e. no coherence. 

We might now visualize what will happen if a second aerial is placed close 
to the first. This can again be considered as driven by a sinusoidal oscillator and 
hence it will also emit a regular sinusoidal wave train out into space. If the 
oscillators driving the two aerials are not connected or related to one another in 
any way, we will not be able to predict what particular effect the combination of 
two wave trains will have. At one time the wave trains may be in a phase and 
reinforce each other, whereas at some other time they may destructively interfere. 
This type of emission in which the wave train from a given oscillator bears no 
definite relation to the phase of the wave train from another oscillator is again an 
example of incoherent radiation. If a large number of such aerials or oscillating 
systems is considered, we can make an estimate of the average amplitude and 
energy of the net radiation at any point by just assuming a random distribution 
between their phases. On the other hand it would be far more satisfactory if we 
could arrange that all the aerials and oscillating systems emitted waves which 
were in step with one another or, in other words, had phase coherence amongst 
themselves. 

This phase coherence can be quite readily obtained for radio waves; all that 
is necessary is to connect a synchronizing link between the different oscillators so 
that either all the oscillators produce their maximum voltage swings at the same 
time, or, if the aerials are spaced apart by any finite amount, the time lag in the 
voltage swing corresponds to the path difference between the aerials. It is by 
using such multiaerial systems, all driven by the same master oscillator, that large 
aerial arrays can be constructed with very precise directional properties. It can 
be seen, therefore, that the crucial difference between incoherent and coherent 
radiation so far as the radio waves are concerned, is whether the different aerial 
and oscillator systems are emitting wave trains with a definite phase relation 
between them or not. The importance of coherent radiation in radio communi- 
cations can be very readily illustrated. In fig. 2 the wave form of a frequency 
modulated signal is considered under various circumstances. Figure 2 (a) 
shows the unmodulated carrier wave as it would be produced by a single oscillator 
or aerial. In fig. 2 (b) the wave form is given for the same carrier wave but now 
frequency modulated by an incoming signal. We can now imagine this frequency 
modulated wave being transmitted by four aerials. In the first case we consider 
the four aerials as linked to the same master oscillator and hence emitting waves 
which are correlated in phase. The total wave train at any point a long distance 
away from these aerials will therefore be of a form shown in fig. 2 (¢), i.e. the 
effects will have added in phase. On the other hand, if these four aerials are 
emitting wave trains with no correlation between their phases we are likely to 
obtain the superposition as represented in fig. 2 (d). It will be seen that all the 
information previously contained in the frequency modulation has now been 
completely lost. This is one of the major disadvantages of incoherent radiation in 
that it is quite impossible to modulate it with information in any precise or 
quantitative manner. It is true that a very gross form of amplitude modulation 
can be effected by altering the overall intensity of the incoherent beam, but it is 
then not possible to transmit a large number of such amplitude modulated signals, 
as is possible on a carrier frequency of coherent radiation. 


438 D. J. E. Ingram 


At this point it might be relevant to consider the quantitative figures as they 
affect the question of transmission of information. In order to transmit all the 
information that is required to build up a television picture, a bandwidth 
covering 4 Mc/s is necessary. Since the carrier frequency must be an order of 


DRL 


AW AVAVAVAVAVAVAVAUAA CANCE 


\yuvw yyy © 


Fig. 2. Wave form of frequency modulated carrier. (a) Unmodulated carrier. (b) Fre- 
quency modulation on single carrier pulse. (c) Superimposed output from four 
such wave trains with coherent phase. (d) Superimposed output from four wave 
trains with random phase relation. 


magnitude larger than the modulation frequency, it follows that television 
programmes are normally broadcast on frequencies of 40 Mc/s or above, and that 
in a frequency range between 40 and 52 Mc/s it would be possible to include 
three different television programmes. If a coherent radiation is available then 
all the information represented by these different television bands can be 
transmitted at the same time and still selected at the receiving end. If we now 
move up in frequency to the microwave region where coherent radiation of such 
frequencies as 4,-5,000 Mc/s can be produced, it follows that 100 times the 
amount of information can be transmitted in a coherent beam covering this 
frequency range, compared with the television frequency range from 40 to 
50 Mc/s. If we now consider radiation in the visible region, it follows that there 
would be a bandwidth of 3x 10’ Mc/s between radiation of a wavelength of 
4000 A and that of wavelength 7000 A. It would, therefore, be possible to 
transmit simultaneously all the information required for over a million television 
programmes if a light beam of coherent radiation were available. The enormous 
information capacity of such coherent beams of radiation at this frequency is 
one of their main applications. 

All the normal sources of radiation in the visible region of the spectrum 
produce incoherent radiation and are therefore not suitable for any type of 


Lasers 439 


precise modulation or detailed transmission of information. The mechanism 
whereby visible radiation is emitted can be briefly summarized as follows. The 
emission or absorption of visible radiation corresponds to a transition in atomic 
energy levels involving the outer electronic structure. It is thus possible to 
represent the process as in fig. 3 (a) where the bottom energy level corresponds 
to the ground state of the atom concerned and the top level is the excited state, 
and a transition from the top to the bottom level produces emission of radiation 
in the visible region. In order to produce such emission it is therefore necessary 
to excite the atom to the higher level by some means, and this energy can be 
supplied in a variety of ways, such as by impact with fast moving electrons as ina 
gas discharge tube or neon lamp. 


SS et eras, (earns 
oe eee 


Sa Seen a 
(a) Cb) ©) 


Fig. 3. Emission and absorption processes between two energy levels. (a) Spontaneous 
emission. (b) Stimulated emission. (c) Absorption. 


The lifetime of the atoms in the excited state is usually very small, although 
for some metastable states this may be of the order of one hundredth of a second. 
Even in the absence of any other radiation, however, the atom will always revert 
to the ground state emitting the visible quantum as it doesso. This emission of 
radiation produced when the atom returns to the ground state without being 
affected by any other radiation in the process is termed ‘ spontaneous emission ’. 
It may be considered as very similar to radioactive emission in that one cannot 
predict the actual time or phase of the emission of any one atom, although a 
definite half-life can be given for the average over the atoms in the upper level. 
So the wave trains corresponding to these emitted quanta of radiation will be 
entirely uncorrelated with similar emissions from other atoms in the excited 
level. The situation is, therefore, comparable with the aerials in our previous 
example which were not connected or correlated in any way. Moreover there is 
no correlation over any length of time, that is, even the same atom when next 
excited and re-radiating will not radiate with a phase which is correlated in any 
way to that of the wave train it last emitted. The argument given above shows 
that such radiation in the visible region will be quite useless for any precise 
modulation and hence the information content of such radiation will be extremely 
low. 

If we compare this situation with that already considered for the radio waves, 
it will be obvious that the feature lacking in this visible radiation is the correlating 
link between the different oscillators. If such a link could be arranged so that 
each atom in the excited level were made to radiate its wave train in synchronism 


44() D. J. E. Ingram 


with the phase of others which had undergone the same transition just before, 
then a coherent constructively interfering pattern would be produced. 


3. SPONTANEOUS AND STIMULATED EMISSION 


In order to consider the different processes of absorption and emission that 
can occur between atomic energy levels, let us first take the simplest case of two 
such levels as already envisaged in fig. 3. We have already seen that if an atom is 
excited to the upper level by such a process as electron collision, then it will in 
time fall to the bottom level and emit radiation of frequency corresponding to 
the energy quantum as it does so. This emission which does not require any 
effect to initiate it is termed spontaneous emission. If irradiation of this same 
frequency is already present however, we will find that the atoms in the upper 
level are stimulated to emit their energy much more quickly than they would by a 
simple process of spontaneous emission. The presence of the original radiation 
therefore produces what is called stimulated emission which differs from the 
spontaneous emission in two ways. First it is emitted from the excited atoms in 
a much shorter time interval after excitation has taken place, and, secondly, the 
emitted radiation is in phase with the stimulating radiation. The whole process 
of stimulated emission can therefore be represented diagrammatically as in fig. 3 (6) 
where the incoming quantum is seen to act on the excited atom and cause it to 
emit radiation of the same frequency and phase. 

As well as these two processes of emission which take place there is also the 
converse process of absorption. It is not possible to have anything corresponding 
to spontaneous absorption of course, ‘since no energy is available in the system 
itself to cause the necessary transition. All absorption processes must, therefore, 
take place under the action of incoming radiation and in this sense are stimulated 
processes. This distinction is not necessary now, and the process of absorption 
can be simply represented as in fig. 3 (c) where the incoming quantum is seen to 
raise the atom in the lower level to the higher level exchanging its energy in the 
process. We must now consider what factors determine the amounts of emission 
and absorption that will take place. 

There are two basic parameters which will govern the quantity of radiation 
emitted or absorbed, i.e. the number of atoms in the given energy levels and the 
probability that these have of undergoing either absorption or emission. If the 
system under consideration is one that obeys classical Maxwell-Boltzmann 
statistics then the ratio of the number of atoms in the two energy levels will be 
given by 


n,/n,=exp[—(E,—E,)/kT] (1) 


It therefore follows that there will be somewhat fewer atoms in the higher energy 
level. 

. It was shown by Einstein that the two processes which are produced by 
interaction with the radiation field, i.e. stimulated emission and absorption, 
both have the same probability associated with them, thus the probability of 
absorption taking place can be represented as 


Pa - 2 (2) 


where pals the probability that one atom will absorb a quantum of radiation, and 
this is multiplied by the number of atoms in the lower level to give the total 


Lasers 44] 


probability for the system. In the same way the probability pse of stimulated 
emission taking place can be written 

Pse. 1, =Pa- My, (3) 
where the probability for stimulated emission has been put equal to that for 
absorption. It follows from this that the overall probability for the system that 
absorption of radiation will take place is greater than the probability of stimulated 
emission, and hence if these were the only processes which occurred, the 
equilibrium distribution would be rapidly changed as more atoms were transferred 
from FE, to E, than from E, to E,. 

There are, however, other effects which will cause extra downward transitions 
and thus restore the population distribution towards that corresponding to ther- 
mal equilibrium. In the simplest case of a perfect black-body radiator in equili- 
brium with its radiation field, this extra process will be the spontaneous emission 
we have already considered. The radiation emitted from this energy level system 
will consist partially of stimulated radiation in phase with the incoming quanta, 
and partially of spontaneous radiation which is emitted with a random phase. 
From our previous considerations of coherent radiation, it is evident that the 
stimulated emission has the properties of coherence and if we can arrange that 
our emission is mainly of this type and that this is greater than the absorption, 
then a coherent radiation source will have been produced. 


—90000 ___p, —9290000___, 


(a) Cb) 


eco NE ee 


Fig. 4. Effect of incident quantum on energy level system. (a) Normal energy level 
distribution. (b) Inverted energy level distribution. 


In order to see whether it is possible to produce a net output of stimulated 
emission, let us consider one incident quantum of the correct frequency as it 
interacts with the atoms as arranged in a normal thermal distribution across 
the two energy levels, as shown in fig. 4 (a). Since the coefficients of absorption 
and stimulated emission are equal, it follows that there is more likelihood of this 
incident quantum being absorbed, than producing stimulated emission, and hence 
we cannot expect a net output of coherent radiation from a system which has 
fewer atoms in the excited state than in the ground state. But if we take the 
same energy level system and invert the population distribution as shown 
in fig. 4(b), then there is more likelihood of the incident quantum stimulating 
another quantum in phase with it than there is of it being absorbed. ‘These two 
quanta may now go on and interact with other excited atoms to produce further 
quanta all in phase with each other and thus reinforcing as coherent radiation. 


442 D. JoE. dagram 


Any source of coherent radiation in the visible region must therefore be one 
in which the radiation is predominantly produced by stimulated emission 
rather than by spontaneous emission. Also for this to be so the normal population 
distribution of the atoms between the energy levels must be inverted so that 
there is a larger number in the excited state than in the ground state. This 
problem of inversion of energy level populations was first considered and 
achieved in the microwave region when the ammonia and solid state masers 
were developed. 


4. INVERSION OF ENERGY LEVEL POPULATIONS 


The first method of achieving the inversion of energy level populations was 
that employed by Townes and his collaborators® in the design and construction 
of the ammonia maser. In this case the inversion of the energy level population 
was achieved by an actual physical separation of the atoms in the two energy 


ENERGY 


ELEGTRIGAFIELD 
_—__ {> 


MOTION OF MOLECULE 


IN EXCITED STATE MOTION OF MOLECULE 


IN GROUND STATE 
(b) 


NG 


Fig. 5. Method of population inversion for ammonia molecule. (a) Energy levels in 
applied electric field. (6) Movement of molecules in quadrupolar electric field. 


levels, and those in the higher level were focused through a hole into a cavity 
resonator whilst those in the ground state were eliminated. 

The possibility of carrying out such a physical separation arises from the 
Stark effect of the ammonia molecule and the way in which the two. energy 


Lasers 443 


levels change on application of an external electric field. This is shown in fig. 
5 (a) and it is seen that the excited level rises as the electric field strength increases 
whereas the lower level falls. If we imagine ammonia molecules in an electric 
field represented by the dotted line xy on the diagram, then those in the excited 
state will tend to move towards the region of lower electric field strength whereas 
those in the lower state will tend to move towards the regions of higher electric 
field state. So it is only necessary to pass a beam of ammonia molecules through a 
region of inhomogeneous electric field in order to produce a literal separation of 
the molecules into the excited and ground state groups. If some form of quad- 
rupolar electric field is employed as shown in fig. 5 (6), ammonia molecules projected 
in a beam perpendicular to the plane of the diagram will move towards the centre 
of the quadrupolar arrangement if they are in the upper states whereas they will 
be deflected outward and away from the axis if they are in the ground state. 

It was in this way that Townes and his collaborators were able to produce a 
beam of excited ammonia molecules and focus them through a hole into a cavity 
resonator where the stimulated emission was much greater than the net 
absorption. 

This kind of physical separation is not possible in the solid state of course, and 
in the solid state masers another method of inversion has been employed®). ‘This 
relies on the fact that, if a very strong incident radiation field is employed, it is 
possible to interfere with the thermal distribution of the energy level population 
and to produce numbers of atoms in the excited level which are very close to the 


n, 
E, 
AN MASER 
Midi OUTPUT 
—,———_ | —_L— _ 1, 
V3 
Ng 


Es 


Fig. 6. Energy level diagram for three-level solid state maser. 


number in the ground state. Equations 2 and 3 above show that this will occur if 
the other processes additional to stimulated emission cannot equalize the larger 
overall probability of absorption as opposed to stimulated emission. ‘This 
phenomenon in which the numbers in an excited state are made more or less 
equal to those in the ground state is termed ‘ saturation ’, and can be quite readily 
achieved in the microwave region when working with some compounds at low 
‘temperatures. The low temperatures reduce the interaction between the thermal 


444 D. J. E. Ingram 


vibrations of the lattice and the energy level systems, and thus reduce the 
relaxation processes which would tend to maintain thermal equilibrium. 

We might now consider what will happen if we take a substance such as a 
transition-group atom with three energy levels spaced as shown in fig. 6. In the 
absence of any incoming radiation the population of the three levels will be 
governed by Maxwell—Boltzmann statistics and 7, will be less than ny which will 
be less than m3. Now let incident radiation be applied of frequency 743 and of 
such a power level to saturate this transition. ‘Then the numbers of atoms inthe 
E, and Ey levels will be equalized and it therefore follows that the number of 
atoms now in the top level will be greater than that in the level Z, immediately 
below. This process of pumping at a higher frequency to saturate an excited 
level can therefore be used to invert the population distribution between the 
higher level and an intermediate one. This principle has been employed in most 
of the solid state microwave masers which have been used in practical 
applications. 

This same principal (or modifications of it) has also been used to produce 
the inverted population distribution in the lasers which have so far operated; 
these will be considered in detail in Section 6. 


5. GENERAL REQUIREMENTS FOR LASER OPERATION 

There are really three basic requirements for any system which is to operate 
as a laser. ‘These can be summarized as follows: 

1. A working medium must be available with a suitable energy level 
system. ‘This energy level system must contain two levels with an energy 
difference corresponding to the frequency of desired operation, and 
must also contain other levels between which saturation pumping or 
similar effects can be produced. 

2. Some means of producing an inverted energy level population must be 
devised, so that there are more atoms in the upper level of the optical 
transition than in the ground state, and thus there is a higher probability 
of stimulated emission rather than absorption. 

3. Asufficiently large number of atoms in the excited level must be available 
to the incoming irradiation if any noticeable beam of coherent emitted 
radiation is to be produced. ‘This requirement is met in the microwave 
masers by the use of a cavity resonator which effectively reflects the 
microwaves to and fro thousands of times and thus enables them to 
interact with many atoms in the upper state. Some such system must 
also be employed in any laser if a coherent beam of any measurable 
intensity is to be obtained. 

A suggestion for meeting this last requirement practically was made in 1958 by 
‘Townes and Schawlow(). Their paper represents the first step in the develop- 
ment of the laser as an experimental possibility. They suggested that sufficient 
path length could be obtained by using a reflecting system very similar to that 
employed in a normal Fabry—Pérot étalon. In this system two mirrors are aligned 
to be accurately parallel to one another to a very high degree so that a ray of 
light travelling normal to the planes of the mirrors will be reflected to and fro 
between them a large number oftimes. ‘Thus if the gain, due to excess stimulated 
emission in the activated medium, is greater than the losses which occur on 
reflection at the mirrors, there will be a net amplification of radiation during 


Lasers 445 


each passage and hence a large amplitude will be built up as a result of the many 
reflections. Such a system will produce a plane wave front with very little 
angular divergence, since any radiation which starts off at a slight angle from the 
normal to the mirrors will leave the activated medium after relatively few 
reflections and will be lost to the general beam of coherent radiation. This 
system as suggested by Townes and Schawlow should fulfil all the necessary 
requirements for a laser as outlined above; and provided a suitably activated 
medium can be inserted between the mirrors, coherent monochromatic radiation 
with highly directional properties should be produced, this being triggered off 
by a noise phonon of the right frequency. 

Nearly all the successful lasers have employed optical systems of this type. 
The technical requirements for such Fabry—Pérot reflecting plates are very severe, 
since they must not only be aligned accurately parallel but must also be flat 
to a high degree of precision. These points have been considered in detail in a 
recent paper by Heavens in which he shows that for a reflectance of 0-90 at the 
mirror’s surface the variation in the flatness of the surface must be less than 
A/30. The parallelism and accuracy of these reflecting windows can be tested 
by setting up a system of fringes corresponding to those normally observed 
with the Fabry—Pérot étalon 


6. EXPERIMENTAL LASERS 


The first laser to operate successfully did so in July 1960 and was designed 
and constructed by T. H. Maiman(*) of the Hughes Aircraft Company. He 
employed a ruby crystal as the activated medium and the parallel reflecting 


PARTIALLY 
SERS MIRROR 


V4 


FULLY SILVERED 
MIRROR 


[> LASER OUTPUT 


ELECTRONIC 


sae FLASH TUBE 


CRYSTAL 


Fig. 7. Schematic diagram of ruby laser as constructed by Maiman(®). 


surfaces were obtained by polishing the two ends of this crystal very accurately 
so that they were parallel to better than 6 seconds of are: Jifa solid crystal is 
used as the active medium the demands of homogeneity for the optical properties 
of the crystal itself are also very great. his is one of their main limitations. 
A diagram of the ruby laser as designed by Maiman is shown in fig. 7, and it 1s 
seen there that the back end of the crystal is completely silvered whereas the 


446 D. J. E. Ingram 


front end is only partially silvered, and it is through this end that the pulse of 
coherent radiation is emitted when laser action takes place. 


The energy level system associated with the laser action is that of the impurity 
chromium atoms which replace some of the aluminium atoms of the aluminium 
oxide, this being the main constituent of ruby itself. In Maiman’s initial 
experiments the concentration of the impurity chromium atoms was about 
0-05 per cent and it is these atoms which produce the pale pink colour in the 
crystal. The energy level of these chromium atoms are shown in fig. 8, and the 
crucial level in this system is level C which is a metastable state. If the atoms 
are excited into this level they will have a life time of about 10~? sec if they are 
not stimulated to emit beforehand. The transition from this level to the ground 
state gives a wavelength of 6943 A, and it is this radiation which constitutes 
the red fluorescence of the ruby crystals. In order to produce laser action in 
this system sufficient chromium atoms must be raised to this metastable level 
so that the stimulated emission produced when they revert to the ground state 
will be greater than the losses due to all types of absorption. 


i A 


OVO /\ INERENREREEROEEN 


MRE 


ahs NE 3 


Sera aaee Cc Metastable 
stafe 


Excitation by 
1 electronic flash LASER 
OUTPUT 


@) Cb) 


Fig. 8. Energy levels of chromium atoms in a ruby crystal. (a) Before pumping. (bd) After 
pumping. 


The chromium atoms can be raised to this metastabl 
them to the higher levels A or B and then letting them pag 
they will do relatively quickly. All that is required is, therefore, a source of 
radiation with a frequency range which will cover the transition from the ground 
state to level A, and also of sufficient power to excite a large number of ae 
atoms to the upper levels. The situation will then be as indicated in 8 (b) 
These atoms will then quickly fall to the metastable level C and then be SAAGE 
for laser action when stimulated by incoming radiation of the correct frequency 


Lasers 447 


or by a noise phonon if oscillation rather than amplification is required. Maiman’s 
particular contribution was the discovery that an electronic flash tube could 
supply all the required pumping power and this could be wound around the 
ruby crystal as shown in fig. 7. 

The large amount of internal heating which occurs when such a flash tube is 
used means that this system could only be used to produce pulses of coherent 
radiation, and could not act as a continuous source. It was found that the power 
level of the electronic flash tube was quite critical; below a certain level only the 
incoherent red phosphorescence of the ruby was obtained, spread over its 
normal wide range of frequencies. Once the level of the electronic flash was 
increased above the critical value, an intense beam of red light would be emitted 
within a period of 10-*sec after the electronic flash, and this indicated that 
sufficient chromium atoms had then been pumped up to the excited state so that 
the stimulated emission they produced was greater than the losses which occurred 
at the ends of the optical system. This practical demonstration that coherent 
optical radiation could be produced via a process of stimulated emission confirmed 
the basic ideas and theories concerning laser action. Quite a large number 
of other lasers operating with similar single crystals have now been built and 
operated ® (), but all these do suffer from two inherent drawbacks. First, 
since the activated medium is in the solid state there is a definite limit to its 
homogeneity and therefore to the directional coherence of the emitted radiation. 
Secondly, the thermal stresses set up in the solid by the absorption of radiation 
from the electronic flash tube prevent continuous operation and hence the use 
of efficient modulation techniques. Both of these limitations can be overcome if 
a gaseous medium is used to supply the activated atoms and such a system will 
now be described in some detail. 

The first laser system using gas molecules which operated successfully was 
that proposed by Javan of the Bell Laboratories in 1959 and successfully operated 
by him in 1961). This produces a continuous coherent output and only requires 
50 watts of input power. 

The optical system is again based on the Fabry—Pérot reflecting plates; 
these are mounted at either end of a glass tube containing the gas, and can be’ 
adjusted for parallelism by metal bellows which connect them to the main tube 
(fig. 9). The energy level system employed is that of the neon atom as shown on 
the right-hand side of fig. 10. The crucial level in this case is the 2S level at the 
top of the system, and laser action is produced by stimulated emission which 
takes place between this level and the next, the 2P level. The necessary large 
number of excited atoms in this 2S level is obtained by an indirect method 
employing the presence of helium gas. Helium has a long lived metastable state, 
i.e. 22S on the left of fig. 10, and atoms in the ground state of the helium atom 
can be excited to this metastable state by such processes as electron impact in a 
gas discharge. Since the excited helium atoms will then be in the same energy 
state as that corresponding to the 2S level of the neon atoms, it is very simple 
for them to transfer their energy by a direct collision with the neon atoms and 
excite a neon atom directly to the top 2S level. In a suitable mixture of helium 
and neon gas it is, therefore, possible to build up a dynamic equilibrium in which 
there are a large number of atoms in the 2S level of neon, even although this is 
not a metastable state itself. It may be noted that two features distinguish this 
energy level system from that used in the ruby laser. First the excess population 


448 D; Jobsingram 


in the higher level associated with the laser transition is now achieved not because 
of the particular properties of this level itself, but because it can be fed from a 
level of equal energy in another atom with metastable properties. The second 
feature is that the pumping energy is supplied in this case not directly by a 
photon of higher energy as in the electronic flash tube, but by collision with an 
accelerated electron in a gas discharge. It is possible in principle, of course, 
to use any method of excitation. 


Bellows Micrometer for 
| -adjusting parallelism End mirror 
of mirrors 


End mirror 


/ 
/ 
/ 


: We - 
Helium - neon 2 SIRF. generator 


mixture ~~ for excitation 
of discharge 


Fig. 9. Schematic diagram of gas laser designed by Javan et al, (°), 


It should be pointed out at this stage that much background research is 
always required before a satisfactory system can be developed. In the case of 
this first gas laser, the transfer cross-sections corresponding to the helium-neon 
collisions were studied in detail in afterglows produced by a pulsed radio frequency 
discharge containing varying amounts of helium and neon. The time variation 
of the densities of the 2S and 2P levels was also observed by fluorescent decay of 
these particular levels. ‘The possibility of direct transfer from the 23S level 
of the helium atom to the 2P levels of the neon was ruled out by another set of 
experiments involving relative intensity measurements at fixed times in the after- 
glow. It was only by painstaking experiments of this type that the correct 
conditions for optimum operation of the laser were determined beforehand, 
and the actual device could be designed precisely. 

‘The chamber containing the helium neon mixture consisted of a quartz tube 
80 cm long and of inside diameter 15cm. The quartz tube is terminated at 
each end with the larger metal chambers holding the Fabry—Pérot reflecting 
plates which can be adjusted by the flexible bellows (fig. 9). The separation of 
these plates themselves is 1 metre and the discharge is excited by external 
electrodes fed from a 28 Mc/s generator. Laser oscillations have been observed 
at five different wavelengths, i.e. 11180, 11530, 11600, 11 990, and 12070 A. 
The strongest oscillation occurs at the 11530 A wavelength and has an output 
power of 15 milliwatts. 

Both of the lasers described so far are in reality light oscillators rather than 
light amplifiers. In each case the whole process of stimulated emission is initiated 


Lasers 449 


by a noise phonon present in the active medium and the gain and feedback of 
the system is sufficiently large for such a noise phonon to produce oscillations. 
But in a large number of applications an amplifier is more useful than an oscillator, 
and the successful design and construction of such a light amplifier has recently 
been reported. 


HELIUM NEON 


25 He 


Electron 
Volts 


\ YV\Y\-® LASER OUTPUT 
Dee 
== 
18 S 
© 
Ses 
16 
& He Ne 


Fig. 10. Energy level systems of helium and neon atoms as used in the Javan laser. 
Successive energy transitions are indicated by the figures in circles. 


This coherent light amplifier was designed and constructed by Jacobs, Gould 
and Rabinowitz®). It employed caesium vapour as the active medium and was 
thus of the gas laser type rather than the solid-state type. The caestum vapour 
was excited by selective optical pumping following the method suggested by 
Schawlow and Townes). In this method of excitation the caesium vapour is 
pumped with the wavelength of the helium radiation 3888 A. The helium 
discharge lamp emits an intense radiation at this wavelength and it also overlaps 
three of the caesium energy levels as shown in fig. 11. It can be seen from this 
figure that the incident radiation from a helium discharge lamp will excite 
caesium atoms from the 6S,,, to the upper 8Pj,, energy level. The population 
of the 8P,,. level can, therefore, be made larger than that of either the 851). 
or the 6D3,. levels. It should, therefore, be possible to obtain laser action at 
either of the wavelengths corresponding to these two transitions which are 7 u 
and 3, respectively. The experimental measurements were actually carried 
out at the wavelength of 3-2 since better techniques of detection are available 
here employing lead sulphide cells. 

OP. 


1F 


450 D. J. E. Ingram 


In order to prove that coherent light beams were being amplified, it_was 
necessary to have a source of coherent radiation to feed the amplifier. This 
source was also formed from activated caesium vapour pumped by the same 
helium line—in other words, a laser which oscillated rather than amplified. ‘bite 
output from this source was modulated at a frequency of 105 cycles per sec and 
fed to the laser amplifier itself which consisted of a caesium cell of 90 cm length 
illuminated by a long helium discharge lamp mounted behind it. The output 


pee 


S\S\/\ > LASER OUTPUT 


6Dy, 


ENERGY a 8Py, 
& 


8Sy, 


~~ ©.388, Helium pumping 
wavelength 


6S Yo 


Fig. 11. Energy level system of caesium atoms as used in light amplifier. The helium 
3888 A line which acts as a pump is shown as a heavy line. 


from the amplifying cell was fed to the lead sulphide detector, and the modulation 
on the light beam was detected in a phase-sensitive amplifier with a final time 
constant of a third of a second. Signal-to—noise ratios of 100 : 1 were obtained 
corresponding to a power of 10~° watts of coherent radiation. The amplifying 
action of the caesium cell was confirmed by switching off the various activating 
elements in turn, and it was shown that a gain of 6 per cent was obtainable 
with the 90 cm celllength. A theoretical treatment shows that the amplification 


coefhcient is related to the populations of the two energy levels concerned by the 
equation 


hone Se [Ba Be aio) 


Lasers 451 


where S(v) is the normalized line shape function. B is the coefficient of induced 
emission. For a Doppler profile on the line this expression reduces to 
_| (loge2)"]3 Ao* (ny _ 8s 
kv) =| eee sa alae (N.- &N,) 
where A =spontaneous emission coefficient g,, g are the statistical weights and 
AXq@ is the Doppler width. 

The crucial parameter in the above expressions is, of course, the difference 
in the population of the two levels (V,—N,). From the energy level populations 
when the system is pumped by intense radiation from the mercury lamp (table 1) 
it should be possible to produce 30 times as great a population density in the 
8P1/2 level as in the 6D3,5. Although the experimentally observed gain was not 
as good as that predicted by the simple theory above, the fact that amplification 
of light could be produced in this way does complete the second essential step 
in the design and construction studies of laser techniques. 


Table 1 
Relative population of caesium levels when pumped with 3888 A helium light. 
Level 8Pi/2 8P3/2 8S1/2 6Ds3/2 5D3/2 
Population 100 30 20 3 200 


7. RESULTS OBTAINED AND FUTURE APPLICATIONS 


From the general point of view, the ideas of both the amplification and 
production of coherent light beams have been experimentally confirmed. There 
is no doubt that more efficient amplifiers with higher gains will rapidly be 
developed, and these should play a very important part in future communication 
networks. The enormous information capacity of coherent beams in the visible 
region has already been stressed in the introduction, and a simple calculation 
shows that it would be possible to include sufficient telephone circuits on one 
beam of coherent radiation spread across the visible region to provide separate 
communication channels for everybody in the world at once. As well as the 
enormous information bandwidth that coherent radiation in this part of the 
spectrum can carry, it also has one other extremely valuable feature for communi- 
cation purposes. This is the high directivity associated with the plane waves 
emitted by such a device. The wave fronts emitted from the ruby type lasers 
which have been so far constructed are limited by inhomogeneities tn the crystals 
themselves, but if such effects can be overcome the theoretical limits on the 
divergence of the emitted beam would be of the order of 5 x 10~° degrees. In 
fact, the angular spread of the emergent beam from the helium-neon gas maser 
was found to be less than one minute of arc and this indicates the high order of 
directivity which is immediately available even with these experimental lasers. 
Such high directivity means that these beams can be transmitted over very 
great distances without losing their energy density. ‘This fact may prove to be 

_extremely important for inter-satellite or inter-planetary communication. 

The other striking practical difference between this type of radiation and that 
emitted by the more usual incoherent sources is the energy density that can be 
obtained by suitably focusing the coherent beams. It is impossible to increase 
the absolute brightness of an image formed from incoherent radiation above that 


452 D. J. E. Ingram 


of the emitting object, whatever optical systems are employed in the intervening 
region. On the other hand, if coherent sources of radiation are available, the 
wave fronts can be focused and made to constructively interfere and thus 
produce far higher energy densities than those which occur at the surface of the 
emitting medium. For instance, the four milliwatt output obtained from the 
gas laser could be focused into a spot of 10-4cm diameter by a lens of 1 cm 
focal length and the radiant power density at the focus will then be 1-5 million 
watts per square cm. Such a figure could easily be increased by several orders of 
magnitude when larger and more efficient lasers are built. This intense power 
density has already been demonstrated quite strikingly with the ruby laser by 
focusing its output on to a carbon target when a brilliant blue-white flame is 
produced from the incandescent carbon raised to over 8000 °c. A large number 
of practical applications might be expected to follow from the availability of such 
power concentration, among which are precise welding techniques im situ and 
medical applications involving the destruction of selected tissue in well defined 
areas. The investigation of the basic properties of matter itself in the presence of 
the extremely high electric and magnetic fields associated with these radiation 
densities will also prove an extremely interesting study. 

It would probably be fair to predict in conclusion, however, that the major 
applications and importance of lasers and their associated devices will probably 
be in the field of communications in the immediate future. The potentialities 
of such coherent beams are only just beginning to be appreciated, and although 
there are some practical drawbacks, such as the fact that for terrestrial communi- 
cation any such beams will have to be enclosed within pipes to prevent their 
dissipation by atmospheric weather conditions, there will nevertheless be a very 
wide field of possible applications which immediately spring to mind. ‘That this 
is already so, can be seen from the very great interest and research effort that is 
already being concentrated on these devices in both this country and the U.S.A. 


REFERENCES 


(1) Gorpon, J. P., ZeEIcER, H. J., and Townes, C. H., 1954, Phys. Rev., 95, 282. 

(2) BLOEMBERGEN, N., 1956, Phys. Rev., 104, 324. 

(3) ScHaw_Low, A. L., 1958, and Townes, C. H., Phys. Rev., 112, 1940. 

(4) Heavens, O. S., 1961, Hilger Fournal, 6, 63. 

(5) Marman, T. H., 1960, Nature, 187, 493. 

(6) Co.uins, R. J., NELson, D. F., Bonn, W., Garrett, F. G. B., and Katser, W., 1960, 
Phys. Rev. Letters, 5, 303. 

(7) Soroktn, P. P., and Stevenson, M. J., 1960, Phys. Rev. Letters, 5, 557. 

(8) JavaN, A., BENNETT, W. J», and Herriort, D. R., 1961, Phys. Rev. Letters, 6, 106. 

(9) Jacoss, S., GouLp, G., and Rasrnowrrz, P., 1961, Phys. Rev. Letters, 7, 415. 


The Author: 


Professor David Ingram is Professor of Physics at Keele University and went there in 
1959 from the Electronics Department at the University of Southampton where he had 
been successively a Research Fellow, Lecturer and Reader. He graduated at Oxford 
University and stayed on to do four years research work there under the supervision of 
Professor Bleaney. His research interests have been concentrated in the field of microwave 
spectroscopy and electron resonance studies, and as well as publications in this field he 


has also written two books entitled Spectroscopy at Radio and Microwave Frequencies 
and Free Radicals as Studied by Electron Spin Resonance. 


Ampére as a Contemporary Physicist 
by Re AL RY TRICKER 


The basis of electrodynamics, what Maxwell called its cardinal principle, 
lies in the laws which govern the action of one current-carrying conductor upon 
another. ‘These can be expressed in several forms which are more or less 
equivalent at bottom and capable of being transformed in many cases from one 
into the other. The earliest of these was the law of action of current elements, 
according to which the effect of an electric circuit is to be looked upon as the sum 
of contributions from the current elements of which it is composed. For com- 
plete circuits this law can be transformed directly into that of the equivalent 
magnetic shell and from this the ‘ work law’ can be derived, if it is preferred to 
work from that. The name of Ampére is sometimes associated with the law 
of action of current elements in a general way. Sometimes his name is reserved 
for a particular expression. Sometimes the names of Laplace, Biot, Savart or 
Grassman are employed. ‘There is little system about it. 


Many of the older books referred to the well known expression 


t2dse sine 
a (1) 
for the magnetic field H generated by an element ds of a circuit carrying a 
current 2, at a distance 7 from the element (@ being the angle between the element 
and the line joining it to the point in question) as Ampére’s Law!. Occasionally 
it was called Laplace’s Law?. In books written more recently it is generally 
referred to as the law of Biot and Savart. Some call it Grassman’s formula. 
It was, as most elementary books explain, Oersted who first demonstrated 
the action of an electric current upona magnet. He had announced his intention 
of looking for the effect as early as 1807 but was still demonstrating negative 
experiments as late as 18193. Others were also engaged in the search and the 
effect looked for seems to have been the orientation of an electric pile suspended 
by a thread, or its action on a magnet, but on open circuit. In 1819-20 Oersted 
tried the effect of closing the circuit but as he held the connecting wire at right 
angles to the magnet, again he obtained no result. It was at the end of a lecture 
in which he demonstrated this negative result that he thought of holding the 
wire parallel to the compass needle and first obtained a positive result. At 
first it was weak and uncertain but he soon improved upon it and finally 
announced his finding publicly in July 18204. He did not determine the quanti- 
tative laws governing the phenomenon but restricted himself to a qualitative 
description of his observations. 


1'This is so for example in Jean’s Electricity and Magnetism, Pidduck’s Treatise on 
Electricity, Whetham’s Theory of Electricity, and Gray’s Dictionary of Physics. 

2For example in Hadley’s Magnetism and Electricity for Students. Philip Parker 
(Electricity and Magnetism) gives it as Ampére’s or Laplace’s law and states that it was 
given by both. 

3 See Whittaker, History of the Theories of Aether and Electricity, p. 81. 

4 Oersted: Translations in Thomson’s Annuls of Philosophy XVI 1820, p. 273, and 
in Yourn. of Tel. Engineers, vol. 5, p. 459, 1876. 


454 R, A. Ro Pericker 


After Oersted’s break-through results came thick and fast. It is perhaps 
a pity that his name is not associated with the law to which his observations 
first gave rise, in addition to those whose contributions were also but limited, 
but when so many have been concerned it is not possible to commemorate all. 
Arago, who had just returned from abroad, described Oersted’s experiments to a 
meeting of the French Académie des Sciences on the 11th September 1820. 
On the 18th September, just a week later, André Marie Ampére (1775-1835)? 
showed that two parallel wires carrying currents attracted each other if the 
currents flowed in the same direction in both wires and repelled each other if 
the currents were in opposite directions. 

On the 30th October 1820, Biot and Savart? announced the results of experi- 
ments in which they determined the strength of the magnetic field generated 
by a current in a long straight wire. ‘They timed the oscillation of a small 
magnet held at various distances from the wire, the magnetic field of which was 
arranged sometimes to oppose and sometimes to reinforce that of the earth. 
Sometimes the earth’s field was annulled by means of an auxiliary magnet. 
This achievement is to be measured in light of the fact that their only source 
of electric current was the electric pile and elaborate measures had to be taken 
to allow for its variations. It is remarkable that they succeeded at all under such 
difficulties. They ‘‘ were led to the following result, which expresses accurately 
the actions exerted on a molecule of austral or boreal magnetism ”’ (i.e. a north 
seeking or south seeking pole) ‘“‘ when placed at any distance froma thin cylindrical 
wire of indefinite length rendered magnetic by the voltaic current. From the 
point where the molecule is, draw a perpendicular to the axis of the wire; the 
force which acts on the molecule is perpendicular to this line and to the axis 
of the wire. Its intensity varies inversely as the distance ’’. 

On the basis of this result Laplace deduced that the magnetic field produced 
by an element of current must follow a law of inverse squares. There seems 
little doubt about this though he does not appear to have written any paper on 
the subject—at least none appears in his collected papers. The fact is mentioned 
both by Ampére® and Biot* himself. 

While the experimentation of Biot and Savart had been magnificent, their own 
theoretical interpretation of the results was much less happy. Speaking of the 
action of a current element they said:° “ La nature de son action est la méme que 
celle dune aiguille aimentée qui serait placée sur le contour du fil® dans sens 


* Annales de Chimie et de Phys. Ser. II, XV. p. 59, 1820. 

® Annales de Chimie et de Phys. XV, 1820, p. 222. 

3 Mém. d’ Acad. 30 Aug. 1826, p. 232. 

* Précis Elémentaire de Phys. II, p. 122. ‘ C’est ce qu’a fait M. Laplace. Ila déduit 
de nos observations que la loi individuelle des forces élémentaire exercées par chaque 
tranche du fil conjonctif était la raison inverse du carré de la distance, c’est a dire précisé- 
ment la méme que lon sait exister dans les actions magnétique ordinaire. Cette analyse 
montrait que, pour compléter la connaissance de la force il restait encore 4 déterminer si 
l’action de chaque tranche du fil était la méme dans toutes les directions A distance égale 
ou si elle était plus énergique dans certains sens que dans d’autres.” : 

* Ann. de Chimie et de Phys. XV, 1820, p. 222. 

° This is expressed more explicitly in the fourn. de Phys., de Chim., d’ Histoire Naturelle 
et des Arts XC, 1820. ‘La nature de son action est la méme que celle d’une aiguille 
aimentée que serait placée tangentiallement au contour du fil, de sorte qu’une particule de 
magnétisme austral et une particule de magnétisme boréal en seroient sollicitées en sens 
contraires, quoique toujours suivant la méme droite par la construction précédent.”’ 


Ampére as a Contemporary Physicist 455 


determiné et toujours constant par rapport a la direction du courant voltaique’’. 
Ampére! pointed out that the field of a small magnetic needle varied inversely 
as the cube and not the square of the distance and that if the needle was placed 
tangentially to the cross section of the wire, as it would have to be to produce 
a field at right angles to the axis of the wire and the perpendicular to the point 
under consideration, a complete circle of such needles round the circumference 
of the cross section would form a closed loop producing no external field at all. 

Biot went on to measure the magnetic field of a long wire which had been bent 
at one point. ‘The two pieces (see fig. 1) were held in a vertical plane at equal 


Bue 
Zz a 
we ioe 
Be =. 
= nn 
a = 
Cte 
ee 
rg 
ee 
os a 
Zoo 
yee 
ae thet = le, Pa fae as 
igaale 


angles ZMH, CMH to the horizontal and the forces on a magnetic pole placed 
at points along the horizontal line mM lying in this plane and passing through the 
point at which the wire was bent, were measured, as before. Biot reported : 
“ J’ai trouvé ainsi que, pour lun comme l’autre”’ (i.e. for the bent wire and the 
long straight wire) ‘“‘l’action est réciproque 4a la distance . . . mais Vintensité 
absolue était plus faible pour le fil oblique que pour le fil droit, dans la proportion 
de langle ZMH al’unité. Ce résultat, analysé par le calcul, m’a parue indiquer 
que l’action de chaque élément du fil oblique sur chaque molécule m de 
magnétisme austral ou boréal est réciproque au carré de sa distance pm a cet 
molécule, et proportionelle au sinus de l’angle muM forme par la distance «M 
avec la longuer du fil.’’? 


1 Mém. d’Acad., 1823, p. 357, 360. ' 
2 Précis Elémentaire de Phys., Tome II, pp. 122 and 123. Quoted by Ampére, Mém. 


d’Acad., 1823, pp. 281-282. 


456 RevAgRetiricker 


Fig. 2. Ampére’s early attempts at finding the law of interaction of current elements 
employed apparatus in which the force experienced by the circuit carrying a current 
due to another was directly measured. ‘This is Ampeére’s diagram of his apparatus 
for investigating the attraction and repulsion between parallel currents. CD is 
pivoted at E and F and swings so that it lies in the same horizontal plane as AB along 
which a second current passes. 


Fig. 3. Ampére’s diagram of an apparatus he designed to measure the force exerted by a 


current in SR upon a current in the suspended wire BC. He soon abandoned this 
type of investigation because of the difficulty in maintaining steady currents. 


Ampére as a Contemporary Physicist 457 


Unfortunately there is again an error of calculation. Ampére pointed out 
that the law which, in fact, is correct, does not follow from the results stated by 
Biot. ‘To obtain the law it is necessary for the force to vary, not as the angle 
ZMH but as the tangent of half that angle—as M. Savary had shown (Mém. 
@’ Acad. 3rd February 1823). Biot subsequently corrected this mistake. 

Ampé€re’s main paper occurs in the Mémoires de I’ Académic de Paris for 
1823. ‘This volume, however, was not published until 1827 and in it, when it 
was printed, were incorporated later communications so that the order of 
precedence becomes confused. ‘The date of writing, however—the 30th August 
1826—1is mentioned in the paper (p. 368). In it are included the results of work 
already communicated earlier. 

Ampeére showed himself master of both the experimental and the theoretical 
situation. His experimentation was brilliant and his theoretical treatment of 


~~ 
d Gg if mm 


, Z $4 4H 


Fig. 4. Ampére’s apparatus for showing that a wire could be bent ae Sic 
without affecting the magnetic field it produced at distant points. ‘The ae ee 
is bent into zig-zags, is mounted at the same distance from the suspende sae ae 
as the straight wire PQ. The same current flowing in these two wires Ee noe 
no deflection in GH. By this experiment Ampere justified his resolution of curr 


elements. 


458 R. A. R. Tricker 


his results masterly. The question which he attacked was this. If the eee 
of one circuit carrying a current upon another Is interpreted as the resultant o 
interactions between pairs of current elements in each circutt, what must be the 
law of action between them? In his experimental arrangements he soon 
abandoned attempts to use steady currents in apparatus _ that shown in 
fig. 2 and fig. 3 and designed experiments working on null principles. His 
experiments consisted in devising balanced situations in which a suspended 
circuit was not deflected by a nearby fixed one when current was switched on in 


both. 


Fig. 5. Ampére’s apparatus for showing that the force experienced by a short length of 
circuit is at right angles to the conductor. ‘The wire AA’ in the form of an arc of a 
circle is suspended by means of the arm OG and balanced by a counterpoise at Q. 
A current is sent along AA’ via the mercury cups N and N’ filled brimming full. 
AA’ experiences no deflection from a second circuit when the supporting arm OG 
lies along a radius but it does if this is not so. 


His theory was based upon four such experiments. The first showed that a 
current doubledback along itself produced no magnetic field at external points; 
the second (fig. 4) that a wire bent into the form of small zig-zags produced the 
same effect as a straight wire at the same distance; the third (fig. 5) that the force 
on an element of a circuit when acted on by a closed circuit, was always at right 
angles to the element; the fourth (fig. 6) that if the linear dimensions of a pair of 
circuits and their distances from one another were multiplied by a constant 
factor the interaction between them was unaltered. 'The whole investigation 
lasted six years and was finally read to the Academy on the 30th August 1826.1 

In the theoretical field Ampére conceived his problem to be as follows. 
The third experiment shows that the force upon an element of current exerted 


‘For a simple deduction of the Biot-Savart formula from Ampére’s experiments see 
On the Teaching of Electricity in Schools (John Murray). 


Ampére as a Contemporary Physicist 459 


by another current forming a complete circuit is at right angles to the element. 
Can a law of action between current elements be deduced in which the forces 
each exerts upon the other are equal and opposite and have as their line of action 
the line joining the elements, thus agreeing with Newton’s third law and yet 
give a force at right angles to one element when integrated round the complete 
circuit of which the other element forms a part? In this Ampére really 
restricted himself more than is strictly necessary. To satisfy Newton’s law 


h \ 
Ny MK 


wo 


Fig. 6. Ampére’s apparatus for showing that the force between two circuits is independent 
of the linear dimensions as long as the geometry is not otherwise altered. The radii 
of the three horizontal circles are in geometrical progression. The centre circle is 
free to move being pivoted by mercury cups I and N. A current sent through the 
three circles causes no movement of the centre circle if its centre divides the line 
OO’ in the common ratio of the radii. The force exerted upon the suspended circle 
by the larger circle at the larger distance is balanced by that exerted by the smaller 
circle at a correspondingly smaller distance. 


it is necessary for the forces to be equal and opposite and to have the 
same line of action, but it is not necessary that this line of action should pass 
through the elements themselves. 

As early as the 4th December 1820 he announced that the required law must 
be of the form 


ieee : ce Rds 

r 
wherez. ds andz. ds’ are the two current elements, 7 the distance between them, 
6 and 0’ the angles each makes with the line joining them (in the direction of r 
increasing in each case), w the angle between the two planes determined by the 


line joining the elements and the directions of the elements themselves and k is a 
constant (see fig. 7). 


(sin @. sin 0’. cosw+k. cos 6 cos 6’) (2) 


460 Ry ps Ra cl ricker 


It was not until the great paper of 1826! that Ampére gave the value of k to 
be one half and his formula therefore, ss 
en TUS AUS 
= 2 
r 


(sin 6. sin 6’ . cos w +3 cos @ cos 6”) (3) 


Fig. 7. 


In this he employed so-called electrodynamic units which are 1/2 times 
smaller than electromagnetic units. If, therefore, z and 7’ are measured in 
electromagnetic units Ampére’s formula becomes 
Ke OS 


ee 3 ue (cos 6. cos 6’ +2 sin @. sin 0’ . cos w) (4) 
r 


or as it is more usually expressed, 


palit 4. ds (5 dr 2 — 


ds ds’ ds. ds’ (>) 

The deduction of expression (3) occupied roughly the first half of the paper 
of 1826. The second half was devoted to a theory of magnetism which ascribed 
it to circulating molecular currents which, in the aggregate, would be equivalent 
to a solenoidal current flowing round the magnet. ‘This was shown to account 
qualitatively for all the phenomena of magnetism and by the law of action of 
current elements just deduced, also quantitatively for forces between magnets 
and between them and electric currents. 

Ampére used his law of action of current elements to determine the force 
which a solenoid PQ (fig. 8), infinite in length in one direction (the direction 


of Q), exerts upon a current element 7.ds. He showed that this force was 
proportional to 


re 


Do WS . Sun W 


v2 
that its line of action passed through ds and was at right angles to the plane 
containing 7 and ds. Since this force was the resultant of a large number of 
forces between ds and the current elements of the solenoid, which themselves 
experienced equal and opposite forces, it followed that the force acting upon the 
solenoid due to ds was equal and opposite to that acting upon ds itself and had the 


same line of action. In other words the force due to ds acting alone would 
possess a moment about the end of the solenoid. 


1 Mém. de l’ Acad., 1826. 


Ampere as a Contemporary Physicist 461 


Ampére then proceeded to integrate his expression round the circuit of which 
ds formed a part and showed that for a complete circuit the resultant force exerted 
upon the solenoid passed through the end of the solenoid, which thus behaved 
as one end of an infinitely long magnet. By considering two such solenoids 
with currents circulating in opposite directions and displaced relatively to one 
another he was able to work out similarly the behaviour of a magnet of finite 
length when placed near an electric current. In this way he established the 
equivalence of his law of action of current elements with the action between 
a current and a magnet as investigated by Biot and Savart and others. Remem- 
bering that Ampére had no more to go upon than Newton’s laws in mechanics, 
Oersted’s qualitative observations and his own experimental results his achieve- 
ment is remarkable in the extreme. It is small wonder that Maxwell referred 
to him as the Newton of electricity. 


Re Ss == 7 
A 
/ Be 
g / 
ee 
VL yi 
ye ds / 
7 A a 
i pate k pe 
‘i 1 yy \ ZL 
/ 
y Lf F rH Se we 
Za = = N 
\ 
\ 
\ 
\ 
\ 
ig & 
\ 
\ 
S15 2 oP Q 
Mi 
| om 
Fig. 8. 


Ampére’s interpretation of the interaction of two electric circuits as being 
the sum of the interactions in pairs of the constituent current elements, each 
member acting upon the other with equal and opposite forces having as their line 
of action the line joining the elements has since been criticized by many writers. 
These criticisms have continued down to the present day. What has been 
criticized has not been so much the idea of current elements, though that has 
not been altogether exempt, as the particular law which Ampére formulated. 
It is interesting that such a scientific controversy should have lasted for so long 
that it brings Ampére into the field of contemporary physics. This is related 
to the fact that while it is possible to render a small part of a circuit movable 
and thus measure the force exerted upon it, it is not possible to isolate an element 
of the other circuit which gives rise to this force and determine its contribution 
to the total. It has therefore not been possible to subject Ampére’s expression 
to the test of experiment. It is, of course, easy to test the result of the integration 
of the expression round a complete circuit, at least in certain cases, but many 
different expressions can be framed which yield the same result on integration 


462 RAS Ra “Bricker 


and experiment cannot distinguish between them. The reader may well 
incline to the view that the discussion of a question which cannot even in 
principle be put to the test of experiment cannot be looked upon as scientific. 
Nevertheless this has not prevented Ampére being subjected to criticism by 
many people. 

Among the first to do so was Grassman.1_ Reverting to eqn. (4) and employ- 
ing «, the angle between ds and ds’ themselves in place of w the angle between 
the planes defined by r and ds and ds’ respectively Grassman put Ampére’s 
formula into the form:— 


fps a Beak (2 cos €+3 cos @. cos 8’) (6) 
14 
since 
cos «=sin 6 sin 0’ cos w—cos 6 cos 6”. (7) 


Grassman then made two points, one of which was valid but inconclusive 
and the other really metaphysical in nature. His first point was that current 
elements, unlike material particles, possess direction as well as magnitude. 
There is thus no a priori reason why the law of action to which they conform 
should be the same as that obeyed by material particles. This is, of course, 
perfectly true and, indeed, would still be true had current elements magnitude 
only since they are certainly not material particles and might well differ from 
them in their behaviour however convenient it might be if they turned out to 
obey the same rules. 

Grassman’s second point was as follows. Consider the case of two current 
elements which are parallel so that e=0 and 6= 7—®’ (remembering that @ and 
6’ were both measured in the direction of 7 increasing). 

Equation (7) then gives us 


pees MUSEEAS 


re 


(2=3,cos*0)- (8) 


If now cos? 6=2/3 or cos 20=1/3 the force F becomes zero. Moreover as 
@ passes through the value } cos~11/3 the force changes from an attraction to a 
repulsion. Grassman finds this to be so improbable that he rejects it. He does 
not do so on account of any disagreement with observation and, indeed, no 
experimental test is possible. His decision is really one of taste. De gustibus 
ml disputandum. 

Grassman’s difficulty here is that he is not content with a statement that 
electric currents react on each other as if each element of one circuit reacted 
with each element of the other according to a certain rule. In fact, of course, it 
is never possible to say more than this. Science can never be more definite 
about concepts which transcend direct perception by the senses than to say 
that a situation is as if it were composed in a certain way. This applies to the 
structure of matter looked upon as consisting of atoms, electrons etc. just as much 
as to current elements. Grassman was not alone, of course, in his time in 
wanting to know what was the ‘ real’, as opposed to a possible explanation. 

Grassman’s endeavour was to replace the current elements of Ampére 
which it was impossible to subject to experiment, by something else which could 
be so tested. ‘To do this he introduced what he called a ‘ winkelstrom ’. that 
is, a current flowing along the arms of an angle. He pointed out that a eient 


+ Poggendorf’s Annalen, LXIV, pp. 1-18. 


Ampére as a Contemporary Physicist 463 


flowing along the sides of the triangle abc (fig. 9) could be looked upon as being 
composed of the three winkelstroms fabd, dbce and ecaf, those arms of the angles 
lying beyond the triangle, such as fa carrying equal and opposite currents from 
two winkelstroms. Any circuit could thus be resolved into its component 
winkelstroms which could therefore replace Ampére’s current elements. 
Grassman does not discuss the closure of his winkelstroms but since he envisages 
the field of a ‘ winkelstrom’ being tested by experiment he clearly is thinking 
of it being completed at infinity. He does not, however, base his conclusions 
on the results of experiments with winkelstroms but uses Ampére’s formula to 
calculate the force they would exert on an element of a second circuit. Since 
in the case of his winkelstroms he is now considering in effect a complete circuit 
it is, perhaps, not altogether surprising that he should find the force to be at 
right angles to the element and in magnitude the same as is given by the formula 
of Biot and Savart. 


€ 
Fig. 9. 

A later questioner of Ampére’s work was Heaviside. In 1888 he wrote : 
“Tt has been stated on no less an authority than that of the great Maxwell, 
that Ampére’s law of force between a pair of current elements is ies cardinal 
formula of electrodynamics. If so, should we not always be using it ? De 
ever use it? Did Maxwell in his treatise? Surely there is some mistake ; 
Later in 1893 he wrote:2 “In fact, in the long series of investigations in electro- 
dynamics to which Oersted’s discovery and the work of Ampére, Henry and 
Faraday gave rise, it was customary to consider an element of a conduction pee 
as generating a certain field of magnetic force. Natural as this course ser as 
seemed, it was an unfortunate one, for it left the question of the closure of the 
current open; and it is quite easy to see now that this alone constitutes a great 


1 See Whittaker, op. cit., p. 88. ; a 
2 Oliver Heaviside, Electromagnetic Theory, Viole ep. O-+. 


464 RAR ei rickesr 


hindrance to progress. But so far as closed currents are concerned in a medium 
of uniform inductivity, this way of regarding the relation between current and 
magnetic force gives equivalent results to those obtained from the first law of 
circulation” (i.e. [H .ds=4m times current enclosed) © in the limited form 
suitable to the circumstances stated... .. 

But from the Maxwellian point of view this field of H is that corresponding 
to a certain circuital distribution of electric current, of which the current element 
is a part; this complete current being related to the current element in the same 
way as the induction of an elementary magnet is to the intensity of magnetization 
of the latter. Calling the complete system of electric current a rational current 
element it may easily be seen that in a circuital distribution of rational current 
elements the external portion of the current disappears by mutual cancelling 
and there is left only the circuital current made up of the elements in the older 
sense. We may therefore employ the formula” (the Biot-Savart formula) “ to 
calculate, without ambiguity, the magnetic force of any circuital distribution of 
current.” 

Alas, in spite of its provocative christening the ‘ rational’ current element 
proved no more popular than Ampére’s formula. Do we, in fact, ever use it ? 
Nevertheless Heaviside was correct in calling attention to the necessity for 
considering only complete circuits if the Biot-Savart form of the law is to be 
employed and that with an isolated current element of the old type the circuit 
was not closed. If Maxwell’s laws and the idea of the displacement current are 
accepted it follows that an isolated element of steady current cannot exist by 
itself. 1 : 
curl H ial +D) 

Since the divergence of any curl is zero it follows that any distribution of 
moving charges together with their associated displacement currents is non- 
divergent, that is to say continuous. A current element, in other words, can 
never be isolated. ‘There is nothing in this, however, which is incompatible 
with Ampére’s conclusions. 

In place of Ampére’s expression Heaviside wished to emphasize “‘ the law 
expressing the mechanical force on an element of a conductor supporting current 
in any magnetic field—the vector product of current and induction”. He 
wished to attach Ampére’s name to it. This still leaves open the question of 
the generation of the magnetic field by the current which gives rise to it and we 
still have to face the difficulty of the properties to be ascribed to the element of 
that circuit. Heaviside solves this problem by his ‘ rational current element ’ 
which he shows produces a magnetic induction given by the Biot-Savart formula: 
Las SINNO 7: 

More recently Whittaker has continued the criticism of Ampére. He says :1 
The weakness of Ampére’s work evidently lies in the assumption that the 
force is directed along the line joining the elements; for in the analogous case 
of the action between two magnetic molecules we know that the force is not 
directed along the line joining the two molecules”. The point is very much 
the same as that raised by Grassman. It must be repeated, however, that there 
is not the slightest reason a priori why the same law should apply in the two cases. 
In fact a variety of expressions can be constructed which when integrated round 


1 Whittaker, History of the Theories of Aether aad Electricity, p. 86. 


Ampere as a Contemporary Physicist 465 


a circuit agree with experience and it is only in this integrated form that any 
of them can be tested. What Whittaker was pointing out was that Ampére’s 
solution based upon the assumption that the interaction has for its line of action 
the line joining the elements is not the most general one possible. In fact it 
discounts the possibility of there being a turning effect in addition to that tending 
to purely translational motion. In the case of the magnetic molecules the forces 
they each exert on the other are equal and opposite and possess the same line 
of action but this line of action is not generally that joining the molecules. 
They tend to turn as well as to move each other with a motion of pure translation. 

Whittaker rejected the Biot-Savart and Grassman formulae on the grounds 
that they were not in accordance with Newton’s third law. To this extent 
Ampére would agree with him. However, Whittaker also rejected Ampére’s 
solution as well, on the grounds that it is not the most general and developed 
another of his own which included the possibility of a turning effect. On 
integrating round a complete circuit, however, all these formulae yield the same 
result. Ampére would, no doubt, defend himself from this criticism by pointing 
out that no experiment performed so far necessitates anything more than a 
purely translational force to be assumed. His assumption is the simplest to 
be made. It is in accordance with Newton’s laws of motion and with the facts 
of experience. More than this cannot be asked of any theory. 

Ampére, in fact, devoted considerable space to the consideration of the 
difference between a current element and a magnetic molecule. Much of what 
he wrote, it is true, is concerned with a discussion of the circumstances under 
which the force between two bodies can be considered ‘ truly elementary ’ and 
this being a matter of taste and definition, was not very profitable. Nevertheless, 
in elaborating his theory of magnetism he showed very clearly that his law of 
action was capable of explaining quantitatively the action of electric currents on 
magnets and the action of one magnet on another, covering in so doing the case 
cited by Whittaker of the interaction of two magnetic molecules. 

An objection of a different kind has recently been made by Geoffrey Builder?. 
He begins by mentioning Whittaker’s criticisms of Ampére discussed above 
and also his rejection of the Grassman formula on the grounds that it is incon- 
sistent with Newton’s third law. Builder then claims to show that “ the 
Grassman formula is correct and that the Ampére formula is untenable because 
it is incompatible with the restricted theory of relativity and with the accepted 
electrodynamical theory of electrons ”’. 

Although his theory is interesting and instructive it does not seem to me that 
he has substantiated his claim. He starts with the Coulomb force between two 
charges g and q’, q being at the origin of coordinates and q’ at a distant point 
moving with velocity v’. The charge q is then given a velocity v and the resulting 
force between g and q’ calculated by means of the transformation formulae of 
relativistic mechanics. ‘This is then applied to the moving charges in a current 
element. Builder says: “A current element 7. ds, of a metallic conducting 
circuit carrying a current 7 may be represented by » pairs of equal and opposite 
point charges g/n and —q/n moving with colinear speeds u and wv respectively 
in the direction of the unit vector ¢ tangent to the current element.” It is here 


that the difficulty arises. 


1 Geoffrey Builder, Bulletin of the Institute of Physics, vol. 9, 12, 1958. 


Cee. he 


466 R. A. R. Tricker 


A current element in an electric circuit carrying a steady current is in a state 
of dynamic equilibrium. It is connected to a source of electric charge at one 
end and to asink at the other. This is no longer true in Builder’s model which is 
equivalent not only to a flow of electric charge but involves a changing electric 
dipole in addition, produced as his charges separate. The properties of a body 
may be profoundly modified by its connection to others. My head, for example, 
is a very different thing (or so I would hope) in its normal position attached to a 
living body than it would be if severed from it, or to give a non-biological 
example, I can sit in a railway carriage until I perish from cold or die of starvation 
but I shall never get to London if it does not form part of a train. When 
Builder’s current elements are connected together to form a circuit the changing 
dipole moments will cancel each other out. We might therefore expect that the 
law which he deduces as a result of his calculations will apply to completed 
circuits but will tell us nothing definite about the action of the individual 
elements. He has, in fact, provided us with another alternative derivation of the 
Biot—Savart formula subject to the same limitations as the previous ones. It is, 
of course well known that his formula gives the correct result for a complete 
circuit. It in no way invalidates that of Ampére, however. 

Rosser has recently given in greater detail in this Journal much the same 
calculation as Builder, with the suggestion that a deduction of the Biot—Savart 
formula from the principles of electrostatics and the theory of relativity furnishes 
the best ground we have for accepting it. It is desirable to examine this 
suggestion. It amounts to accepting the Biot-Savart formula on the grounds 
that it occurs in an axiomatic deductive system and the force of the argument 
will depend upon what other independent evidence there is for the acceptance 
of this system. Any axiomatic system depends for its acceptability upon the 
direct evidence which exists for each of its deductions and the object of building 
such a system is to allow the direct evidence for any one deduction to serve as 
indirect evidence for all the others. If the law of Biot and Savart is included in 
such a system for which there is other independent evidence, then clearly it 
gains in strength and Rosser’s calculations contribute to its acceptability. It is 
equally clear, however, that since the acceptability of the system as a whole 
depends upon the direct evidence for each of the constituent deductions, the 
calculation cannot by itself entirely displace the basis of the law in experiment. 
Moreover since the mechanics of moving charges furnishes a very important 
part of the evidence supporting the special theory of relativity, without which 
the theory would be in a very much less assured position than it is, Rosser’s 
argument cannot be pushed very far. We cannot use the mechanics of moving 
charges to establish the theory of relativity and then use the theory of relativity 
to establish that of mechanics. Furthermore to do what is required the 
deduction would have to establish the law to an accuracy of something like 
1 in 10°—the accuracy of the current balance. On the face of it it seems unlikely 
that a theory dependent upon the approximate evaluation of second order effects 
could achieve this. 

Moving packets of charge, such as those contemplated by Builder and Rosser, 
do not produce steady fields. Associated with the fields which they carry with 
Bee a aay aoe which 1s propagated through space. This intro- 

y into the considerations. If energy and momentum are trans- 
ferred to and from these fields the forces which two such packets experience due 


Ampére as a Contemporary Physicist 467 


to each other need certainly not be equal and opposite. The difficulty, if difficulty 
it be, disappears under steady conditions. This kind of explanation was con- 
sidered by Ampére when he discussed the possibility of forces arising through 
a reaction from the ether. His conclusion was, “‘ Sans qu’on soit autorisé a 
rejeter les explications fondées sur la réaction de I’éther mis en mouvement par 
les courants électriques, rien n’oblige jusqu’a présent d’y avoir recours.”? 

So long as complete circuits are being considered it is only a matter of 
convenience whether Ampére’s formula or that of Biot and Savart is employed. 
The latter is almost always simpler to use and, as Heaviside remarked, Ampére’s 
is not employed in practice. It is only when it is desired to determine the forces 
on a part of a circuit produced by the current in the remainder of the circuit, 
as would be the case in the simple circuit breaker illustrated in fig. 10, that 
difficulties arise. These are of two kinds. The first is that of the equality of 
action and reaction and with Ampére’s formula this does not arise. According 
to the Biot-Savart formula the force on a current element will always be at right 
angles to it whether it is a complete circuit or only part of a circuit which gives 
rise to the force. 


MERCURY CUPS 


Fig. 10. 


Fig. 10. An idealized circuit breaker. The square wire frame ABCD is divided by mercury 
cups at Dand C. The upper portion DABC is counterpoised on a balance. Repul- 
sion between it and the fixed member DC will cause it to rise when a current 7 
is made to flow round the square (depending upon the weight of the counterpoise). 
Connection to the electric supply has been omitted to simplify the diagram. 


If we consider according to Biot-Savart, the force on the side DC of the 
square circuit breaker in fig. 10 (CD is fixed but experiences a force from the 
remainder of the circuit which is counterbalanced by the reaction of its supports) 
it can be looked upon as being composed of three parts, F,p due to the current 
in AD, Faz due to the current in AB and Fgc¢ due to the current in BC. If 
now we consider the vertical forces exerted on the other and movable part of 


3 Cbg {Os SO. 
CoP, Wey 


468 ROAR: Bricker 


the circuit DABC, they will, of course, all be exerted on the member AB, the 
forces on DA and BC being horizontal. Since a square circuit, if floated ona 
liquid, does not tend to move, the forces on the fixed and movable portions of the 
square must be equal and opposite. The force on AB produced by the fixed 
side DC, Fyc, will obviously, by symmetry, be equal and opposite to Fans 
the force exerted by AB on DC. Hence the force experienced by AB will only 
be equal and opposite to that experienced by CD if we include forces F,p and 
Fyc caused by the sides AD and BC respectively. In other words to calculate the 
forces on the movable part of the square circuit it is necessary to include forces 
which parts of it exert on itself. If these ‘ self forces’ are included the Biot— 
Savart formula can be applied in this case also. We have, as I have said before, 
to invent a new mechanics, and that is not, as Rosser suggests, the mechanics 
of relativity. 

The second difficulty applies equally to Ampére’s as to the formula of Biot 
and Savart. It is that if the circuit be assumed to be a line then the integrals 
obtained in calculating the forces exerted by one part of a circuit on the remainder 
become infinite in both cases. The difficulty can be overcome, at least in simple 
cases, by a cumbersome further subdivision of the current elements into sub- 
elements but in practice, neither formula would be used. This difficulty is 
similar to that which occurs in the calculation of coefficients of self induction. 
It is dealt with in the standard method for calculating these coefficients and the 
forces required would be calculated from them rather than by employing either 
Ampeéere’s or Biot and Savart’s formulae. 

The fact that this discussion of the relative merits of Ampére’s formula and 
that of Biot and Savart has lasted for 150 years without reaching any fixed 
conclusion seems to me to illustrate the importance in the scientific field of 
something like a verifiability criterion of meaning. I do not wish to digress 
upon logical positivism or to debate the case for it in other fields than science; 
but in the field of science, any question to which no experimental observation 
can be germane—as in the case of the rival merits of the formulae of Ampére 
and of Biot and Savart—can clearly never be answered finally and moreover 
cannot affect any practical decision which has to be taken. There is, of course, 
a sense in which Ampére’s formula is superior to that of Biot and Savart and 
that is that it is consistent with Newton’s third law whereas Biot and Savart’s 
is not. Weare, in practice, so accustomed to the applications of Newton’s laws 
that to work with expressions which transgress them makes our thinking very 
liable to error. But there is equally a sense in which the formula of Biot and 
Savart is superior to that of Ampére. It is simpler. We are fundamentally 
lazy—perhaps it would be more becoming to say that we are highly appreciative 
of mathematical succinctness—and choose the easiest path. But we have to 
be careful that we do not apply the formula out of context. 


The Author: 


Is one of Her Majesty’s Inspectors of Schools and holds the post of Staff Inspector 
for Science. He took the Natural Science Tripos at Cambridge in 1923 and obtained a 
Ph.D. for work on the relativistic variation of the mass of the electron in 1926. 


The Royal Institution Christmas Lectures, 1961 


Last November I decided that I would take a grandson, aged 16 years, and 
a granddaughter, aged 12 years, to the Christmas Lectures at the Royal Institution 
—the one hundred and thirty-second course of six Lectures Adapted to a Juvenile 
(10-17 years) Auditory. The subject of the lectures was Electricity, and they 
were delivered by Sir Lawrence Bragg, Director of the Davy Faraday Laboratory 
of the Royal Institution. 

I went to discover to what extent these lectures would help teachers to solve 
the problem posed in the following: 


“|. . school physics courses fail to arouse any real interest in or understanding 


of science among those boys and girls who do not continue a specialist education in 
science beyond the normal school leaving age.” 

““ Any criterion of successful teaching . . . must take into account. . . the extent to 
which a boy leaving school at sixteen to be articled to a solicitor carries with him any 
interest in science and respect for those who practise it professionally.” 


In the first lecture Sir Lawrence brought life to the dry bones of traditional 
electrostatics. The use of perspex in place of ebonite and glass made the simple 
attraction and induction experiments behave more regularly and convincingly 


Fig. 1. Van de Graaff machine, 


470 V. T. Saunders 


and the modern fibres enabled us to see the electrostatic lines of force radiating 
from the sphere of the miniature van de Graaff machine (fig. 1) which was fed 
at 104 volts and whose sphere was at approximately 10° volts. 


Fig. 2. The spreading of a powder. 


A doll’s house under an artificial thunder cloud was unscathed when protected 
by a lightning conductor, but when the protector was removed the lightning 
shattered the roof. 

That the study of electrostatics satisfies the Baconian demand that science 
must minister to the needs of man was shown by the spraying of plastic (fig. 2) 
and the spreading of weed-killer, by the carrying of the powders to the required 
places along the lines of force of an electric field. And a demonstration of the 


Fig. 3. Representation of the crystal lattice of germanium. 


The Royal Institution Christmas Lectures, 1961 471 


suppression of the discharge of soot from a model factory chimney, by the setting 
up hoe electric field between a central conductor and the walls of the chimney 
was of topical interest in view of the powers vested in local government thoriti 
by the Clean Air Act. : ice Shia 

The next lecture brought to us frogs’ legs which really twitched, Cruickshank’s 
battery lighting a Christmas tree, the effect of putting a piece of wire wool (from 
the kitchen) between the brass strip terminals of a 4:5 volt torch battery, the 
electrical conductivity of London tap-water and that of aflame. There was also 


Fig. 4. Discharge by a large transformer. 


a vivid demonstration of the photoelectric effect including the quenching of the 
efficacy of an arc lamp by the interposition of a sheet of glass. We also had a 
glance at a semiconductor whose conductivity increased as its temperature rose, 
and were introduced to the germanium crystal lattice (fig. 3) and the associated 
diagrams showing the movements of holes and electrons in a p-type and n-type 
semiconductor. And so in the second lecture we had travelled from frogs’ legs 
to transistors! 

Through the Oersted and Faraday work we were led to electric machines 
and then to electrical oscillations and electromagnetic waves. There was an 
impressive demonstration of the output of a large transformer (fig. 4) and of the 
lighting of a discharge tube by a Tesla coil (fig. 5). 

In the course of these demonstrations the lecturer mentioned Clerk Maxwel; 


and J, who have spent many years trying to teach electromagnetic induction at 
O and A levels, held my breath—what can be said about Clerk Maxwell to some 


472 V. T. Saunders 


400 boys and girls ranging in age from 10 to VW years? In about forty words 
the divs and curls had vanished and the equations had been dealt with qualita- 
tively with such skill and assurance that not one of the Auditory showed signs of 
realizing that so much ground had been covered in so few words. 

Resonance and ways of maintaining oscillations were illustrated; and in par- 
ticular I remember a simple demonstration showing the grid of a triode being 
biased by induction from a charged perspex rod. 


(a) (6) 
Fig. 5 
Tesla coil. Tesla coil lighting a discharge tube. 


Figure 6 shows an aluminium ball on a glass saucer resting on an iron ring 
wound to be fed with three-phase a.c._ When the circuit was closed the ensuing 
motion of the ball round the saucer brought a burst of applause from all over 
the theatre. ‘The magnetron was introduced to us in connection with 3 cm 
electromagnetic waves which we saw focused by a glass lens and reflected by a 
piece of tin plate. A radio pill@, with a pressure transducer, was swallowed by 
Mr. Coates who, when poked in the proper place, gave the expected response. 
In the final lecture we saw some of the latest applications of radar, including the 
following of rain clouds across the sky. Finally the lecturer referred to the 
exchange of electric power between England and France and reminded us that 
this will take place using direct current. 

Now that the course is completed I must consider its impact. First, what were 
the reactions of my young companions, neither of whom is scientifically minded 


The Royal Institution Christmas Lectures, 1961 +73 


but both of whom are in the grammar school stream? Both looked forward 
eagerly to ‘the next lecture’. The 16-year-old boy sat for most of the time with 
his chin in his cupped hands, elbows on his knees and his eyes fixed on the 
lecturer; I suspect that he understood little of the scientific detail and argument 
but he was manifestly fascinated by this new world. The 12-year-old girl was 


apprehensive about bangs and flashes but was clearly determined to see this thing 
through. 


Fig. 6. Ball spinning under the action of 3-phase A.C. 


After each lecture the Auditory were invited to come to the demonstration 
benches to examine the equipment and to ask questions. As I moved amongst 
them their remarks and questions proved to me that the flow of young people 
endowed with the enquiring mind shows no sign of failing. 

Schoolchildren living in London and in the Home Counties are indeed 
educationally privileged not only in connection with the Christmas Lectures 
but also by being able to attend the Afternoon Lectures for Sixth Form Boys and 
Girls which have recently been delivered at the Royal Institution. 


My conclusions are (a) that Sir Lawrence Bragg has shown how the bridge 
over the gap between the scientist and the intelligent non-scientist can be esta- 
blished, and (6) that it is possible to introduce modern physics, without 
misleading sacrifice of accuracy, to school boys and girls. 

I must thank Sir Lawrence Bragg for allowing me to use the subject matter of 
his lectures for this article. I am also most grateful to Mr. W. A. Coates, Senior 
Experimental Officer at the Royal Institution, for setting up the apparatus, after 
the period of the lectures, for the express purpose of taking the photographs. 


REFERENCES 


(1) N. Crarke, Bulletin of the Institute of Physics and The Physical Society, January 1962. 


(2) Contemporary Physics, August 1960, 489. 
V. 'T. SAUNDERS. 


Sciences in Communist China 


by K. MENDELSSOHN 


One of the major historical puzzles is the failure of the great civilizations of the East 
to follow Europe’s lead in developing a scientific technology three hundred years ago. In 
the middle of the seventeenth century it would have been theirs for the asking. There is 
little reason to blame lack of natural ability. Yukawa, Yang and Lee, to mention only a few 
names, have shown that the Eastern mind is capable of making fundamental contributions, 
and recent technological advances in Japan make it clear that they are not mere imitators. 
Possibly a curious historical accident may be responsible. In both Japan and China, 
the time of the first contacts with the West was marked by the emergence of new regimes 
which were strict adherents to old traditions because they did not have any of their own. 
In Japan the Tokugawa shoguns had, after a long civil war, united the country under their 
bakufu, the ‘ camp government ’, which was based on the feudal samurai traditions and they 
sealed off their country with the most meticulous cruelty against all external influence which 
might introduce new ideas. in China the semi-barbaric Manchu invaders who had placed 
themselves on the dragon throne were equally anxious to perpetuate unchanged the living 
conditions under the Ming dynasty which they had replaced. It is significant that both 
the shoguns and the Manchu permitted a limited study of Western development but this 
was kept strictly for their own information and not allowed to come into contact with the 
population of their countries. The mission which the Jesuits were allowed to keep in 
Peking for over two centuries was confined to the precincts of the Imperial City and while 
the emperor Chien Lung showed great personal interest in their astronomical knowledge 
and clockmaking skill, he certainly did not let the Fathers disseminate any Western ideas 
in his country. 

When less than a hundred years ago the Tokugawa were ousted and Japan was opened 
to the West, nobody foresaw the meteoric rise which in a few decades completely altered the 
balance of world power. Modern technology in the capable hands of an Asiatic nation 
led in an astonishingly short time to Pearl Harbour. The Western powers were not going 
to repeat this mistake in China and, instead of pressing for reforms, they did their best to 
prop up the tottering Manchu Court against the Chinese progressives. There was no sign 
of rapid progress in science and technology even after the Manchu abdication of 1911 and 
China lingered on, helpless against her own warlords and the Japanese invader. Then 
after the cataclysm of the second world war a change took place which was as spectacular 
as the rise of Japan. However, China’s new alignment with the Soviet Union withdrew 
much of this development from the sight of Western observers and particularly from the 
Americans. 

It is against this background that we must view the mighty tome of nearly 900 pages which 
the American Association for the Advancement of Science is presenting.+ In order to 
provide some information on the rapid progress of science in China, the U.S. National 
Science Foundation suggested in April 1960 that a meeting should be held under the auspices 
of the AAAS where papers on the progress of science in China should be presented. This 
symposium was held on December 26/27 of the same year in New York and the organizers 
must be congratulated on their efforts of bringing out this very topical material only six 
months later in book form. The speed of publication is perhaps an indication also of the 
urgency with which the technological rise of China is considered in the Western world. 

Another significant aspect of the symposium is that only one of the 29 contributors has 
visited Communist China while the papers presented by the others are based on a survey of 
publications. There are altogether five sections, dealing respectively with science and 
society, biology and medicine, geophysics, physics and mathematics and finally engineering. 
The surveys have clearly been prepared with much care and, apart from the first section, 
the emerging picture is remarkably consistent. Science in China is advancing rapidly and 
expanding at a great rate with the limited number of qualified scientists as the most serious 
bottleneck. There is also a shortage of equipment but one gets the impression that this 
deficiency can be made good more rapidly than the shortage of scientific manpower. 


Scientists in China seem to receive every support and encouragement in their work by the 
authorities. 


Sciences in Communist China 475 


To the readers of this journal the field of physics will be of prime interest but it must be 
reviewed with due reference to the progress made in other branches. China is only beginning 
to emerge from a state of extreme technological backwardness and from long lasting civil 
disorder. ‘This is a survey of only the first decade of the new government and the problems 
of food, health and communications as well as those of the basic technologies necessarily 
overshadow the demands of pure science. Agriculture and medicine in particular claim 
the attention of the Chinese in the present phase of the development of their country. 
In the article on public health the accomplishments in the prevention and control of disease 
are termed as ‘startling’. There is also a short but most interesting section on the attitude 
towards traditional Chinese medicine which, after being treated by Western doctors as 
ridiculous mumbo-jumbo, is again receiving considerable encouragement. The writer of this 
review was impressed by the throng of patients attending an old-style Chinese pharmacy in 
Peking. ‘The tendency of the authorities is clearly to arrive at a new approach to medicine 
in which traditional methods and remedies take their place side by side with the Western 
concepts of clinical practice. 

It is clear that at the present stage of rapid agricultural and industrial development in 
China, survey and study of the physical conditions of this vast country are of prime 
importance. The articles dealing with geography, geology and meteorology all stress the 
concentration of effort in these fields, the increase in published papers, the application of 
new methods and the training of very great numbers of scientists. Control of the available 
water supplies has always been foremost in the mind of the Chinese and, after a decline 
of the irrigation system which set in at the end of the Manchu dynasty and was hastened 
by the chaotic conditions following it, water resources are now investigated and harnessed 
with scientific methods and on a nation-wide scale. 

No research in physics worth mentioning had been carried out in China when the 
Communist Government took over in 1949. Since then a very considerable effort has been 
made but here again emphasis is being laid on subjects which lead to application rather 
than on fundamental problems. In the nuclear field two accelerators and a heavy water 
reactor are in operation and it is pointed out that a good deal of work is being done by 
Chinese scientists working at the Nuclear Research Institute at Dubna near Moscow. 
In the second half of 1960 a third of the papers published from this laboratory had Chinese 
co-authors. Here, as in most other fields, the Russians have given much help in training, 
equipment and in advice. The present writer noticed an appreciable number of Chinese 
students during a visit to Moscow University and his Russian colleagues commented most 
favourably on the quality and industry of these students. Equally, during a visit to Chinese 
centres of training, the Chinese scientists told him of the generous help they had received 
from their Soviet colleagues. 

Work in solid state physics, too, is being started at a great number of centres in China 
and here the main interest seems to be in the investigation of metals. ‘There is, as yet, not 
much in the way of published research, at least in the experimental field, while papers on 
theoretical solid state physics show that the Chinese are quite capable of handling problems 
of current interest. 

Emphasis on problems of practical utility is also the keynote of the report on chemistry. 
Here again it is stated that the progress in the last ten years has been spectacular but that so 
much had to be done that, with the pressing need of applications, the standard of chemical 
research is so far well below that of the West. Much the same is true for all branches of 
engineering. ; j 

Altogether the symposium report tells a story of impressive advances in Chinese science 
and engineering on an extremely broad front. Nowhere has this front as yet reached the 
area of current research in the West or in the Soviet Union but, on the other hand, it seems 
equally clear that the rate of advance is astonishingly rapid and that in the foreseeable future, 
in some fields, Chinese science may catch up with the rest of the world. It also has to be 
remembered that progress in China will be supported eventually by very large numbers. 
While at present the manpower problem in science and technology is one of extreme 
difficulty, China may ultimately command a reservoir of scientific manpower which can 
outnumber any other nation. The report leaves little doubt as to the strong and generous 
backing which science and technology in China are given by government and party. Most 
authors who discuss possible disadvantages of state direction conclude that these are more 


than balanced by the financial and moral state support. 


476 K. Mendelssohn 


There is a curious contrast between the conclusions of the reports on science and tech- 
nology as such and those of the first section dealing with science and society. Almost all 
the authors here paint a gloomy picture of scientists enslaved by state direction and without 
freedom in choosing their field of work. While the details given on the organization of 
research and teaching will be found valuable, the reviewer, for one, must hesitate to subscribe 
to the opinions expressed otherwise in this section. After all, the reports had to be based 
on Chinese published papers and articles and these will necessarily fail to convey fully the 
spirit in which they were written. The one great shortcoming shared to some extent by 
almost all the contributors to the symposium is their remoteness from the conditions in 
China after 1949. They had to rely entirely on the printed word which cannot convey the 
feeling of enthusiasm and the spirit of dedication to the re-birth of a great and ancient 
civilization which are the mainsprings in the work of most scientists working in China today. 
Those who have never set foot in a Communist country usually find it quite impossible to 
gauge the effect which a society so different from ours has on the individual living in it. 
While the daily life and the position of a scientist there can be very different from ours, this 
does not mean that he and his work are necessarily affected in a disadvantageous manner. 
Weare too much addicted to thinking that methods radically different from our own must be 
inferior and leading to a lower efficiency. The appearance of the Soviet satellites and the 
sustained Russian lead in space exploration should have taught us a lesson. 

In spite of their lack of direct contact, the authors of the scientific reports of the 
symposium have in many instances sensed the emotional force so characteristic of] the new 
China, and they are to be congratulated for having recorded their impression in the articles. 
They have done a service to their colleagues in preventing them from underestimating the 
impetus of the new growth of science and technology in China. 

Any report on China today will have to be read with a mind which is open as well as 
critical and Sciences in Communist China is no exception. This will not detract from its 
immense value as a compendium of information and as a guide to source material. Anybody 
in the West who seriously wishes to form an opinion on the development of Chinese science 
and technology at the present time should study the reports of this symposium. 


Tt Sciences in Communist China. Edited by StpNey H. Goutp. The American Association 
oe Ananbngi i of Science, Washington, D.C. (Bailey Bros. and Swinfen, Ltd). (Pp. xii + 
. Se 


ESSAY REVIEW 


Cosmic Rays. Handbuch der Physik (Encyclopaedia of Physics). Edited by S. FLicce. 
Volume 46, Part 1, Cosmic Rays I. 


The subject ‘ Cosmic Rays’ has produced no really outstanding general survey under 
individual authorship. In this respect it has much in common with other branches of 
physics strongly dependent on material outside the control of experimenter, insofar as the 
growth of understanding cannot be channelled through logical schemes from the simple 
to the complex. Moreover, as has become very clear at the present time, hardly any signifi- 
cant feature of the study of cosmic rays follows a straightforward application of principles 
otherwise well-known. 

The emergence of a reasonable pattern of cosmic ray studies may be traced back to the 
decade 1930-1940, with the recognition that the primaries incident on the earth were 
predominantly positively charged while the particles observed deep in the atmosphere 
scarcely showed this property but proved to be mostly of a novel type (muons) and unstable, 
with so short a life that the possibility that they might form any part of the primary incident 
radiation seemed entirely excluded. From this time the development of the subject has 
been along lines which are very much divergent : on the one hand the problems posed by the 
very existence and by the more detailed behaviour of the primary radiation, and on the other 
the problems of nuclear physics and of electromagnetic interactions which arise in the trans- 
formation of the primary beam incident at the top of the atmosphere into the complex 
secondary components which may be studied through the atmosphere and at ground level. 

Under these circumstances, it is not surprising that the main literature has come in 
collections of specialized articles, such as the present volume of the Handbuch, and in mono- 
graphs on very restricted topics of which Arley’s ‘ Stochastic Processes ’ is an early example 
and to which Dorman’s Bapuayun Kocmuueckux myueli (Cosmic Ray Variations) and 
Galbraith’s ‘ Extensive Air Showers ’ are notable and recent additions. 

It is perhaps natural to ask whether, in this situation, the subject ‘ Cosmic Ray Physics ’ 
has enough coherence to justify the intentions, clearly intended in this volume, of treating 
the whole subject in an interrelated manner. The answer must be, that while in very many 
respects the two aspects of the subject are in fact now all but separated, there remain one 
or two features of great importance in which they remain linked and seem likely to continue 
so until these have been studied, and understood, in far greater detail than is possible at 
present. 

One of the striking contributions to the physics of cosmic ray studies up to about 1950 
was the identification among cosmic ray secondaries of pions, muons and of the strange 
particles. The very success of this work ensured that great accelerators could be developed 
in the confidence that exciting and important work awaited them. These machines have 
been more than able to reach the thresholds of all reactions which were suspected in 
any detail as a result of cosmic ray observations, and the advantage which remains with 
natural material in the greater energies which may be explored has not on the whole been 
fruitful in the greater part of straight particle physics. 

At the other extreme, the general composition of primary cosmic rays cannot really be 
studied at all until heights in the atmosphere are reached where the residual air mass over 
the apparatus is of thickness small compared with that in which the primary is likely to 
make its first encounter with terrestrial matter. This single fact makes the effective 
separation. But over and above this, it now seems probable that only under exceptional 
conditions can the very abundant, low energy, cosmic ray primaries actually reach the earth, 
and to study these particles, and the conditions in the space around the earth which exercise 
so strong a control upon their movement, the use of satellites and of space probes is the only 
effective mode of approach. 

The most notable link between these extremes at the present time concerns the 
primaries of exceptionally great energy. On the cosmological side these are of particular 
significance because they are so little affected by local and varying conditions in the im- 
mediate neighbourhood of the solar system. But on the scale set by human technology, 
both in its capabilities and in the space of time which we have come to think of as appropriate 
to the resolution of problems in physical science, those particles of great energy are un- 
satisfactorily rare. For example, primaries of energy of the order 1018 ev or greater (which 


478 Essay Review 


is a significant value of energy in this context) may be expected to fall on an area of one 
square metre exposed near the earth about once in 3000 years. Nothing useful can be done 
unless the reception area is at least four orders of magnitude greater, and it is likely to be 
a long time before such detectors can be established beyond the atmosphere. 

Our own earth, however, is remarkably suited to an indirect detection of these primaries. 
The successive processes which are initiated in the atmosphere by the entrance of such a 
primary leads to a ‘ shower’ of a great number of secondary particles spread widely but 
fairly evenly over a considerable area. This development is near its climax at sae 
when perhaps 108 secondaries are spread at detectable densities over an area of at least 10 
square metres. Useful samples of such a shower may be taken using relatively few detec- 
tors of moderate size: these effectively scan the whole of such an area and have allowed 
such primaries to be detected at a useful rate. Devices of this kind are in operation in 
many parts of the world. ; 

However, what is observed is remote, through a complex sequence of interactions, from 
the real object of our interest, the primary particle. We have as yet only rough ideas as to 
how the magnitude of such a shower is related to the nature and energy of the incident 
primary particle, and, what is perhaps even more important, as to the fluctuations which 
might be expected in such a relationship if we were able to establish it. Here, then, one 
of the most significant of the cosmological features of cosmic rays is inextricably linked with 
the process of observation and so with the nuclear physics at extreme energies through 
which the observation is made. This is an example of a point where separation of the 
divergent aspects of the subject cannot be expected in the near future. 

A collection of specialized contributions is likely to be of one of two kinds. It may be 
of the ‘ Progress’ type, now familiar in many fields of physics. Here the emphasis lies on 
the changing and developing aspects of the subject; there is a good deal of pressure to 
select articles for their immediate timeliness, and the same topic may be set out, as it actually 
develops, in successiv evolumes. The Handbuch der Physik requires a rath erdifferent 
approach ; the articles are less likely to be supplemented as new material becomes available, 
and accordingly, they ought, as far as a fast-developing subject permits, to possess a quality 
of enduring usefulness. From this aspect, the editor and those who have written for him 
have in this book achieved notable success. If the second part of the volume, to which 
occasional unexplained and tantalising references are made in the text, maintains the same 
standard, the whole promises to be an outstanding section of this great work. 

The present part opens with a chapter on the ‘ Origin of Cosmic Rays’ by Philip 
Morrison, and here the choice of material, emphasis and presentation succeed brilliantly. 
Much of the evidence is assumed known, and it may be that anyone quite unfamiliar with 
the subject will find difficulties. But this is a serious attempt, as is essential in the ‘ Hand- 
buch’, to bring out the broad principles about which one is reasonably certain. In his 
closing lines, Morrison expresses the hope ‘ that the model here outlined will at least be 
recognisable in the future development of our knowledge ’, and indeed changes must prove 
far more radical than any of us now suspect if this hope is to be disappointed. 

The names of Stérmer, Lemaitre and Vallarta carry the credit for a great part of the 
development of a manageable treatment of the motion of charged particles in a magnetic 
dipole field as this relates to particle trajectories near to the earth and reaching it from 
infinity, and the editor has been fortunate in that Professor Vallarta has provided the second 
chapter, entitled ‘ Theory of the Geomagnetic Effects of Cosmic Radiation’. This survey 
of the treatment of trajectories in the geomagnetic field is developed with admirable clarity, 
but it is essentially a formal treatment in summary, and it cannot in any way make easy 
reading. ‘The notes concerning the problem set by the non-dipole elements of the field 
are extremely compressed, but these also give a useful indication of the scope of more recent 
developments in this field. 

In writing on ‘ Experimental Results of Flights in the Stratosphere ’, Ernest C. Ray has 
been given an assignment which in the ‘context of the Handbuch is particularly intract- 
able. It is understandable that the editor should wish to have assembled some account 
of the many types of observation that fit this title, and those who have followed the literature 
will find considerable pleasure in this well-digested summary. But in spite of its merits, 
and perhaps because, with its latest references to about 1957, it ceases in the midst of 


important high-altitude work, this article does not seem to have that element of permanence 
which would have been wished for here. 


Essay Review sel) 


Kurt Sitte has produced a notable survey of high-energy nuclear interactions (going 
far beyond the machine range) under a title, ‘‘ Penetaating Shower’s”’ which, in spite of its 
historical associations, presumably is used here because of the difficulty in thinking of any 
comparably brief but more relevant one. This excellent treatment is particularly readable, 
and succeeds in giving some of the atmosphere in which the interpretation of widely differing 
kinds of data has been coordinated. It offers, moreover, the starting point for some of 
the considerations of shower development in the atmosphere which were referred to in 
earlier sections of this review. 

The following chapter, by Guiseppe Cocconi on ‘ Extensive Air Showers’ stands 
worthily besides Sitte’s. It shows comparable clarity, and, since a formal treatment of 
cascade theory seems to be intended in the second part of the volume, is able to avoid over- 
emphasis of the electromagnetic cascade as a shower component. ‘The ‘ Selected Topics ’ 
occupying the last fifteen pages of Cocconi’s article form a most pleasing feature and give 
a valuable glimpse of the significance of deductions about the primary radiation from studies 
of the larger showers. 

The final chapter, by Fowler and Wolfendale, on the ‘ Hard Component of .-mesons 
in the Atmosphere ’, develops from a critical survey of knowledge about the vertical muon 
spectrum at sea-level, which it illuminatingly describes as one of the ‘fundamental constants’ 
of cosmic ray physics. It is a pity that the time of writing has just excluded the important 
extensions of our knowledge of this constant which have come from Dr. Wolfendale’s 
own group. Meteorological effects on the atmosphere muon beam are referred to only 
very briefly, with some hint that they may be treated more substantially elsewhere (pre- 
sumably in the second part of the volume). 

The overall cover of the subject cannot well be examined until the remainder of the 
volume appears. What is already done, however, is quite admirable, partly because the 
treatment appropriate to this series has so often been gauged with conspicuous success and 
partly because the authors have done this in so readable a manner. ‘The production of 
the book is of the expected high standard, although a few errors have been allowed to pass 
(there is trouble with the numbering of Tables in Morrison’s article). It is perhaps a sign 
of the times that all of the articles are in English! J. G. WiLson. 


BOOK REVIEWS 


Protection of Structural Steel Against Fire. By G. 1. Birp. Fire Note No. 2. (Published 
for DSIR by HMSO.) 2s., by post 2s. 4d. U.S.A. 36 cents. 


Structural steel elements, though themselves of course incombustible, will not perform 
the functions for which they were designed when exposed to fire unless protected. This 
pamphlet describes the ‘ solid protection’ and ‘ hollow protection ’’ methods of fulfilling 
the BSI requirements, together with recent applications of light-weight insulating materials. 


Probability and Experimental Errors in Science. By Lyman G. Parratt. (John Wiley & 
Sons, 1961.) [Pp. xv+255]. 55s. 


Lippmann’s well-known dictum ‘“‘ Everybody believes in the exponential law of errors: 
the experimenters, because they think it can be proved by mathematics; and the mathe- 
maticians, because they believe it has been established by observation” reminds us that 
fifty years ago little had been done to synthesize the ideas of those to whom probability 
theory was a branch of pure mathematics and of those who viewed it as a set of empirical 
conclusions founded on experiment. Nowadays there is no basic confusion of thought 
about the normal law of errors or doubt about the validity of its application to numerous 
frequency distributions; but it is the case that, even at research level, the experimentalist 
is all too often woefully ignorant of the proper methods of analyzing the errors inherent 
in his measurements and incapable of designing his experiments so as to minimize these 
errors. The advent of the excellent book under review removes any possible excuse that the 
tault lies in the lack of suitable texts written at the right level and covering this important 
field. 

Everything that the average experimenter needs to know about the subject is to be 
found in this volume. ‘The treatment, which presupposes only a knowledge of elementary 
algebra and calculus, is lucid, stimulating, and well-balanced. Altogether, a most useful 
publication. C. Horst. 


Special Relativity. By W. RINDLER. (Oliver & Boyd, 1961.) [Pp.x+186.] 10s. 6d. 


The principle of the equivalence of inertial observers is consistent with all known 
facts of electromagnetism and of dynamics. Resting as it does on a sure experimental 
foundation, one would have imagined that no modern scientist could doubt its wide validity. 
The recurrent controversy about the so-called clock ‘ paradox’ demonstrates, however 
that there are still some who fail to appreciate the true interpretation and sicnineenee = 
Einstein’s fundamental postulate: there is obviously a place in the world for Dr. Rindler’s 
cogent and elegant presentation of special relativity theory. In this book, the physical 
ideas underlying the subject are discussed with admirable clarity; the mathematical treatment 
is sound in every detail. 

The first three chapters (dealing with the Lorentz transformation relativity kinematics 
and relativity optics) demand nothing more than a logical mind and a knowledge of elenene 
tary calculus; but the student new to the subject must be prepared to think hard, and to do 
this with an open mind. In the remaining five chapters, the author employs a smn le form 
of 4-tensor calculus, so that the casual student will find the treatment less to his likin 
The book is, however, completely self-contained in that the eas Hatheonert 
techniques, which can be quickly acquired, are expounded very clearly in a short appendix 
A pleasing feature of the work is the provision of some 90 problems—some of ae ol 
friends in disguise, but many original—together with answers and hints for solution 

Only in one respect, and this the relatively minor one of units, does Dr Rindler fail t 
maintain a high standard of accuracy throughout the book: the Mancllan equatio d 
certain other electromagnetic formulae are expressed not in terms of eee oe 
units, as is stated to be the case, but in terms of (mixed) Gaussian units C Honecail 


SUBJECT INDEX (Vol. Il) 


Ampere as a contemporary physicist, 453. 
Annual exhibition of the Institute of Physics 
and the Physical Society, January, 1962, 239. 


Biology and the molecular approach, 177. 

British Association Granada lectures, 1961, 297 

British Association meeting at Norwich, 1961, 
152° 


Cerenkov radiation: its origin, properties and 
applications, 45. 

Clean air, 81. 

Colour and perception, 91. 


Diploma in technology: courses in applied 
physics, 202. 
DSIR research establishments, 293. 


Electroluminescence, 112. 

Electromagnetism: second order effect, 28. 

Electromagnetism and electrostatic analogue, 
278. 

Electromagnetism for schools, 353. 

Electrons in metals, 241, 321, 401. 


Faraday’s law of induction and force in an 
electrostatic field, 286. 

Fermi surface, 214. 

Fourth international conference and exhibition 
on medical electronics, New York, 1961, 140. 


Harwell reactor school, science masters’ course 
No. 2, 240. 
High energy instrumentation, 303. 


Infra-red absorption spectra of semiconductors 
58. 

Institute of Physics and the Physical Society 
conference on nuclear physics, 138. 

Interferometry, 212. / 

International colour meetings, 272. 


International conference on cosmic rays and 
the earth storm, Kyoto, Japan, 217. 
International conference on magnetism, Kyoto, 


1961, 222. 
Isotope separation, 304. 


Lasers, 435. 


Masses of nuclei, 299, 
Metals, looking at, 1. 
Metre, the, 415. 


NPL “ microwave mekometer ’’, 290. 
Ozone in the earth’s atmosphere, 375. 


Philosophical basis of physics, 382. 

Physical aspects of lubrication and wear, 257. 

Physical Science Study Committee (4): working 
session on physics teaching at the Cavendish 
Laboratory, Cambridge, 368. 

Physics applied to archaeology, 161, 334. 

Proceedings of the 1960 annual international 
conference on high energy physics at 
Rochester, N.Y., 80. 


Royal Society conversazione, May, 1961, 125. 

Rutherford High Energy Laboratory of the 
National Institute for Research in Nuclear 
Science, 145. 

Rutherford in Manchester, 155. 


Science and mathematics teaching, 292. 

Science Masters’ Association: annual meeting, 
395. 

Solids at high pressures, 134. 

Time and relativity, 16, 194. 


Wool industries research association, 75. 


AUTHOR INDEX (Vol. III) 


ArrKen, M. J., physics applied to archaeology, 
Part I, 161; Part II, 334. 


BALLENTYNE, D. W. G., electroluminescence, 
112. 

BarrELL, H., the metre, 415. 

Brow, D. M., the molecular approach to 
biology, 177. 


Book Reviews: 

AITKEN, M. J., Physics and Archaeology, 
159. 

Autter, L. H., The Abundance of the 
Elements, 315. 

Anprew, H. C., On Teaching Voltaic Cells 
and Electrolysis, 399. 

Aurores et Ciel Nocturne. J. Paton, 309. 

Avery, J. H. and INcRam, A. W. K., 
Laboratory Physics, 160. 

Batpin, A., Goupanskil, V. I. and 
RozentTaL, I. L., Kinematics of Nuclear 
Reactions, 320. 

Birp, G. I., Protection of Structural Steel 
Against Fire, 480. 

BirkHorr, G., Hydrodynamics, 309. 

But Ler, S. T. and Messe, H. (Eds.), Space 
and the Atom, 160. 

Burttin, W. F., Doures, B. W. and Downgs, 
J. (Eds.), Lectures in Theoretical Physics, 
308 


Caro, D. E., McDoNneELL, J. A. and SPIcEr, 
B. M., Modern Physics, 310. 

Climatic Data from Dumont D’Urville. 
J. M. Stagg, 313. 

COPELAND, P. and BENNETT, W. E., Elements 
of Modern Physics, 312. 

De Witt, C. (Ed.), Dispersion Relations and 
Elementary Particles, 319. 

EIsBErRG, R. M., Fundamentals of Modern 
Physics, 398. 

Faraco, P. §., An Introduction to Linear 
Network Analysis, 399. 

FERRARO, V. C. A. and PLumpton, C., An 
Introduction to Magneto Fluid 
Mechanics, 319. 

HeiKes, R. R. and Ure, R. W., Thermo- 
electricity—Science and Engineering, 308. 

ue J. F., Textbook of Reactor Physics, 
400. 

International Dictionary of Physics and 
Electronics. G. R. Noakes, 314. 

Jay, K., Nuclear Power Today and To- 
morrow, 320. 

Jotty, W. P., Physics for Electrical 
Engineers, 316. 

Kern, L. (Ed.), Dispersion Relations and 
oe Abstract Approach to Field Theory, 

Kortuorr, I. M. and Etvine, P. J. (Eds.), 
A Treatise on Analytical Chemistry, 316. 

a, F., The Language of Mathematics, 

ee C., Linear Differential Operators, 

Les Recherches Galactiques et Extra- 
galactiques et la Photographie Elec- 
tronique. M. Johnson, 310. 

LiowarcH, W., Ripple Tank Studies of 
Wave Motions, 318. 

Magnetisme Terrestre. J. M. Stagg, 315. 

MarcxkwortH, M. L. (Ed.), Dissertations in 
Physics, 398. 


Marsuak, R. R. and SuparsHAN, E. C. G., 
Introduction to Elementary Particle 
Physics, 313. 

Meteorologie: Données climatiques a la 
station Charcot et a linterieur de la 
Terre Adélie. J. M. Stagg, 317. 

Moore, C. K. and Spencer, K. J., Elec- 
tronics. A Bibliographical Guide, 313. 

Notes on Applied Science. G. R. Noakes, 
Sai 

Parratt, L. G., Probability and Experi- 
mental Errors in Science, 480. : 

Proceedings of the International Symposium 
on Polarization Phenomena of Nucleons. 
R. J. Blin-Stoyle, 314. 

RINDLER, W., Special Relativity, 480. 

Ritson, D. M. (Ed.), Techniques of High 
Energy Physics, 240. 

Rocers, E. M., Physics for the Enquiring 
Mind, 399. 

Rose, D. J. and Crark, M., Plasmas and 
Controlled Fusion, 318. 

Science Masters’ Book—Physics. R. W. 
Moore, 319. 

Science Study Series, 159. 

Standard and Rapid-Run Magnetograms. 
J. M: Stage, 311% 

TEMPERLEY, H. N. V., Properties of Matter, 
318. 

THOMSON, SiR GEORGE, The Inspiration of 
Science, 307. 

Tosocman, W., Theory of Direct Nuclear 
Reactions, 320. 

Witxs, J., The Third Law of Thermo- 
dynamics, 307. 

Wo tr, E. (Ed.), Progress in Optics, 316. 

WRIGHT, J. K., Shock Tubes, 315. 


BROCKLEBANK, R. W., two international colour 


meetings, 272. 


BROCKLEBANK, R. W., see Wilson, M. H. 
BucHpDAHL, G., the philosophical basis of 


physics, 382. 


CHALMERS, J. A., the laws of electromagnetism 


and an electrostatic analogue, 278. 


Crook, A. W., some physical aspects of 


lubrication and wear, 257. 


Essay Reviews: 


FLicceE, S. (Ed.), Cosmic Rays, 477. 

Harrison, W. A. and Wess, M. B. (Eds.), 
The Fermi Surface, 214. 

Interferometry. H.G. Kuhn, 214. 

es H. (Ed.), Separation of Isotopes, 


Proceedings of an International Conference 
on Instrumentation for High-Energy 
Physics. C. C. Butler, 303. 

Reports on Progress in Physics, Vol. 24. 
Dek. Gibbss1/50! 


Friscu, O. R., time and relativity: Part I, 16; 


Part IT, 194. 


Fritu, R., ozone in the earth’s atmosphere, 


37/3. 


Grake, J. E., Rutherford in Manchester, 155. 
GrirFITH, M. V., the British Association 


meeting at Norwich, 152. 


Author Index (Vol. III) 


Hartow, D. W., the Science Masters’ 
Association annual meeting, 395. 

Hay, E. J., electromagnetism for schools, 
based on M.K.S.A. units. Part I: the 
principles, 353. 


IncraM, D. J. E., lasers, 435. 


JELLEY, J. V., Cerenkov radiation: its origin, 
properties and applications, 45. 


Lee, E. W., Kyoto 1961. The international 
conference on magnetism, 222. 


Marspen, P. L., the international conference 
on cosmic rays and the earth storm, Kyoto, 
Japan, 217. 

Martin, J. W., looking at metals: a lecture for 
schools, 1. 

MENDELSSOHN, K., sciences in communist 
China, 474. 


NIGHTINGALE, A., report on the fourth inter- 
national conference and exhibition on 
medical electronics, New York, 1961, 140. 


ParKINSON, D. H., some aspects of the infra- 
red absorption spectra of semiconductors, 
58. 

Pau, E. B., masses of nuclei, 299. 

PENTLAND, N., the diploma in technology: 
courses in applied physics, 202. 


RAMANATHAN, K. G., Faraday’s law of 
induction and the force on a body due to 
change in its magnetization in an electric 


field, 286. 


Rosser, W. G. V., electromagnetism as a 
second order effect: III, the Biot-Savart 
law, 28. 


SALEM, A., the proceedings of the 1960 
Annual International Conference on high 
energy physics at Rochester, N.Y., 80. 

SauNDErRS, V. T., the Wool Industries 
Research Association, 75; the NPL “‘ micro- 
wave mekometer”’, 290; DSIR research 
establishments, 293; the British Association 
Granada lectures 1961, 297; The Royal 
Institution Christmas Lectures, 1961, 469. 

STEVENS, S. H. G.., clean air, 81. 


Tricker, R. A. R., Ampére as a contemporary 
physicist, 453. 


Urrt Haser-Scuaim, the Physical Science 
Study Committee. (4): the working session 
on physics teaching at the Cavendish 
Laboratory, Cambridge, 368. 


Witson, M. H. and BrockLEBANK, R. W., 
colour and perception: the work of Edwin 
Land in the light of current concepts, 91. 


ZIMAN, J. M., electrons in metals: a short 
guide to the Fermi surface. Part I: the 
electron gas, 241; Part II: bands and zones, 
321; Part III: dynamics of Bloch electrons, 
and the calculation of band structure, 401. 


Printed by Warren AND Son, Lp. 
(asubsidiary of Taylor and Francis Ltd.) 
The Wykeham Press, Winchester, 
England. 


